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ABSTBACT 

Four experiments are reported vhich examine the role 
of phrase structure, memory load, ccncreteness of materials and other 
variables in the recall of meaningful English sentences* Several 
major findings are reported* Concreteness of the stimulus materials 
consistently is an aid to recall; although this is predicted from an 
imagery interpretation of the coding of concrete materials, some of 
the results are at oads with such a theory* Children as young as six 
years see a to chunk sentences for recall essentially the same as 
adults; however, the storage systems seemed to differ* No relation to 
functional knowledge of grammar was found. Besults from two 
experiments question the adequacy of a theory that depends heavily on 
surface structure of the sentence as a medium for storage* The 
general discussion proposes that at least four interactive processes 
are necessary to explain the obtained recall results* (Author) 
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Abstract 

Pour experiments are reported which examine the role of 
phrase structure^ mcruory load^ concreteness of materials and 
other variables in the recall of meaningful English 
sentences. Several major findings are reported; 
concreteness of the stimulus materials consistently is an 
axd to recall; although this is predicted from an imagery 
interpretation of the coding of concrete materials^ some of 
the results are at odds with such a theory. Children as 
youncj as six years seem to chunk sentences for recall 
essentially the same as adults; however ^ the storage systems 
seemed to differ. No relation to functional knowledge of 
gramxaar was found. 

Results from two experiments question the adequacy of a 
theory that depends heavily on surface structure of the 
sentence as a medium for storage. The general discussion 
proposes that at least four interactive processes are 
necessary to explain the obtained recall results. 
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Introduction 

One of the most ronarkable human abilities is the 
capacity to reincrriber things; that isr to store complicated 
infornaticn. There is an entire body of research literature 
in verbal learning examining the parameters and processes of 
free recall of words, paired-associate learning , and serial 
learning of lists of items. However, people more typically 
use their language abilities in the form of sentences; 
paradoxically, learning and memory research involving 
sentences is less widespread than that involving discrete 
words. 

Sentences are typically remembered better than strings 
of words (e.g., Marks & Miller, 1964); hence, the question 
arises: "What do people remember when they remember 
sentences?" It is an everyday observation that people often 
remember the essence of a sentence rather than the specific 
words of the sentence; as well, there is much experimental 
evidence to substantiate this view. Binet and Henri (1894) 
found that the specific characteristics of a sentence were 
not retained in memory. Mehler's (1963) Ss often confused 
active and passive voice (in recall) , butlnost of their 
mistakes were "raeaning-preserving" errors. Mehler suggested 
that Ss remember the kernel string and the transformational 
markers of a sentence independently and thar they are more 
likely to forget the markers. However, Fillenbaum*s (1966) 
Ss made lexical not transformational mistakes while 
preserving the underlying meaning. As well, Sachs (1967a, 
1967b) and V/anner (1968) report that Ss are better able to 
recognize changes in meaning than changes in wording alone. 
It must be noted that transformational changes also entail 
semantic changes. Thus, do Ss remember transformational 
changes or the covarying semantic changes? Even though 
actives and passives are logically synonymous, they are used 
for different purposes (Clark, 1965; Halliday, 1967a, 1967b; 
Hebb, 1969; Johnson-Laird, 1968; Morton, 1966; Prentice, 
1967; Romraetveit, 1968; Tannenbaum & Williams, 1968; Turner 
& Rommetveit, 1968). Thus, it would seem that semantic 
aspects are more important than syntactic ones. 

Why then is grammar important or functional in the 
learning of sentences? It is probably because grammar gives 
an organization to language that is not present in a string 
of words. As Handler (1967) has suggested, not only are 
memory and organization correlated, but organization is a 
prerequisite to memory. It has been posited that people use 
••meaningful units" in sentences as memorial aids (cf. 
Thorndike*s (1931) concept of "belongingness** ) . Johnson 
(196 8a) has proposed that "grammar provides Ss with a ready- 
made and reliable receding scheme" (p. 437) . The meaningful 
units in sentences seem to functionally resemble the 
••chunks" of information proposed by Miller (1956a, 1956b). 

1 
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Miller proposed a linit of 7 (±2) on human short-term memory 
capacity. Since we can rorr.omber more than seven pieces of 
infornation, Miller used the Unitization Hypothesis for an 
explanation and proposed chunking as a method to increase 
storage capacity. V.'e could not simply add seven more items 
of information to our memory for we would then lose the 
first seven. For storage capacity to increase, each 
remembered item must be enriched with more information. 
ThuSf chunks are "informationally-rich items"; Miller thus 
used the idea of chunking in two senses: one was the 
grouping of information into familiar units (chunks) . The 
second use involved the process of chunking chunks to form 
larger and larger chunks of information. 

In sentences, what defines the nature of a chunk? 
Much experimental evidence seems to point toward phrases or 
phraselike units as behaving much like chunks. Maclay and 
Osgood (1959) found that pauses in speech occur before 
lexically difficult items and at syntactic junctions , and 
that repetitions characteristically involve small phrase 
units. Miller (1963, in Garrett & Fodor, 1968) showed that 
when S^s copy text, they "hold" a phrase in memory, then 
consult the text again. Also, a series of experiments from 
MIT (Fodor & Bever, 1965; Garrett, Bever, & Fodor, 1966) 
showed that Ss displace "clicks" imbedded in sentences 
toward major phrase boundaries. Anglin and Miller (1968) 
found that paragraphs presented in phrase segments were 
remeinbered better than those presented with no phrase 
segmentation. Marks (1967) has shown that sentence 
inversions that interrupted major phrase boundaries had 
longer source-identification tines than those with phrase 
boundaries intact. This is similar to Anglin and Miller's 
(1968) finding that paragraphs segmented at phrase 
boundaries had higher recall scores than paragraphs where 
the segmentation did not correspond to the phrase 
boundaries. Epstein (1961, 1962, 1963) found that word 
endings indicative of phrase organization aided 
memorization; Lachman and Tuttle (1965) further specified 
the locus of this effect as being in the storage of the 
sentence not in perception or recall. Epstein (1967) 
segmented syntactically structured but anomalous material at 
phrase junctures; the cues that helped to chunk material 
also aided retention. Johnson (1965) showed that most 
errors of recall occurred at phrase boundaries indicating 
that Ss were using their knowledge of grammar to segment a 
sentence into functional subunits. 

There appears to be much evidence that syntax affects 
sentence retention by defining the phrase structure of the 
sentence. Thic in turn determines into how many chunks the 
sentence is broken. The retention of phrases might 
correspond to the "adopted chunks" noted by Tulving and 
Patkau (1962) ; they found that larger chunks could be 
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constructed v^ith higher approxinations to English (of. also 
Mcl'lulty, 1966). Epstein (1967) , on the other hand, has 
suggested that chunkin.j is facilitating for learning 
sequences only when scr/.e high degree of syntactic , structure 
is present. 

The basic position being offered here is that in memory 
for sentences r what is stored are chunks whose boundaries 
are derived frcn\ the surface structure of a sentence. As 
Wearing (1970) notes r chunks may be nested within one 
another (cf. Miller* s "super-chunks"). Wright (1968) has 
raised the possibility that we can conceptualize sentences 
in terms of both the nun^ier of chunks into which they are 
broken and the number of items of information (words) in 
each chunk. Thus, a sentence such as "The old man thanked 
the boy" could be described as a 3-1-2 sentence. But we 
cannot assume uniform difficulty of the items in a chunk. 
Roberts (19 67) stated that nouns in a sentence were less 
subject to forgetting than adjectives or adverbs. Matthews 
(1968)^ using Savin and Perchonock*s (1965) technique^ found 
that Ss seem to encode sentences by a complete encoding of 
syntactic forms with the emphasi'i on noun and verb 
components and subsidiary and less efficient storage of 
qualifiers. This suggests that nouns and verbs are chunked 
or receded firsts and various qualifiers are chunked 
separately or even discarded if memory load is too high. 
Martin, Roberts, and Collins (1968) claim that when a 
sentence is learned, its grammatical structure is 
determined, and then words are differentially processed into 
memory. The critical variable is their importance for the 
sentence. Perfetti (1969) offered a somewhat different 
approach to this issue. In order to understand his 
argument, it is first necessary to digress a little. Martin 
and Roberts (196 6), using Yngve's (1960) theory, proposed 
that the average depth (J) of a sentence was an important 
factor in sentence ret^.'tion. This was substantiated by 
Martin, Roberts, and Coxlins (1968) using single sentence 
recall. However, Martin and Roberts (1967) found no effect 
for d in recall. This led Perfetti to propose* that it was 
not average depth but number of lexical words that was 
important. In fact, he proposed it was the ratio of the 
number of lexical words to the rotal number of words that 
was of special importance. By his definition, lexical words 
included nouns, verbs, adjectives, etc. This study also 
showed nouns to be of greater importance in retention. In 
an unpublished study. Mile. Maxianne Berger and I have fo\md 
converging evidence using bilingual (French-English) Ss. 
These Ss had to translate sentences mixed in two languages 
(FrangTais) (e.g., ''The quiet priest thanked the homme 
French") . The results indicated that primary attention was 
paid to the language and position of the nouns. 

A recent paper (3egg & Paivio, 1969) has introduced a 

3 
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new factor in this argument. They propose that there are 
actually two codincj mechanisns operating. VJhich system is 
used depends largely on the nature of the stimulus material. 
If the sentences are concrete # nonverbal inages are evoked 
(cf. Ro.Tjr.ctvcit & Turner* 1967); thus, concrete material can 
be stored in parallel by nonverbal spatial imagery rather 
than sequentially, as in strings cf v;ords. Inforrnation 
contained in abstract material, on the other hand^ is linked 
more closely to the sequentially organized verbal units 
themselves. Therefore, with concrete material the meaning 
of the v;hole sentence is stored, but with abstract material 
specific words and phrases are retained. Remember, Sachs 
(1967a, 1967b) and Wanner (1968) have reported that Ss could 
recognize changes of meaning better than changes of words; 
both had used concrete material. Begg and Paivio predicted 
that in a recognition paradigm, Ss would be able to easily 
recognize meaning changes in concrete sentences but would 
find abstract material easier when spotting lexical changes. 
Their results agreed with this prediction. 

There are important consequences of such a dual-coding 
theory. If concrete sentences are coded as images, the 
order of ease of recall should he noiins (easiest) , 
adjectives, verbs, then adverbs. There is more semantic 
distortion as we progress from nouns to adverbs; adverbs 
have less power in establishing concrete sentential meaning. 
We may predict from this that nouns should function as the 
best retrievaJ cues (Horowitz & Prytulak, 1969; Lambert & 
Paivio, 1956; Paivio^ 1963). Another prediction of this 
theory would be that voice should be often confused in 
recall of concrete sentences but not in abstract. Support 
for the dual^coding hypothesis has com.e from a number of 
sources. Sasson and Fraisse (1972) , using a retroactive- 
interference paradigm, found that interpolated concrete 
sentences and pictures produced equal amounts of significant 
interference for recall of coiicrete sentences. Recall of 
abstract sentences was not affected by interpolated concrete 
sentences or pictures but was affected by interpolated 
abstract sentences. Sasson and Frais 3 view their results 
as supporting the hypothesis that the contents of concrete 
sentences are stored primarily as visual images. 
Philipchalk (1972), exploring Pompi and Lachman's (1967) 
notion of a surrogate structure as ^ substrate of connected 
discourse, asked Ss to recall concrete or abstract 
paragraphs that drffered on degree of thematicity (thematic 
or random order) . He found that Ss were most adept at 
recall when materials were thematTc and concrete. This 
finding obtained also after a two-week delay. Philipchalk 
interpreted this to mean that concrete thematic discourse 
has a unique property not shared by either abstract material 
or by non-thematic concrete discourse. Of course^ 
Philipchalk* s manipulation of the thematicity of the 



paragraphs also disrupted an important set of cues for 
organized roteution that have already been discussed — the 
phrase-structure divisiuns of the sentences. Nonetheless , 
it appears that disruption of these cues is more harmful for 
concrete than for abstract sentences. Following Epstein 
(1967), v;ho suggested that chunki.,j is a valuable aid only 
when soipe degree of syntactic structure is present, it may 
be that concrete sentences (when syntactic structures are 
present) are most aided by use of chunking strategies. If 
memory for abstract sentences is more tied to the left-to-* 
right order of words and relies more on word-to-word memory 
than global memory, randomization of the order of words may 
not be removing much of the information useful for recall. 
With concrete sentences on the other hand, the chunking may 
be of a quite different nature. If a surrogate structure is 
formed for thematic concrete discourse (image or schemata) , 
then removal of the phrase-structure chunking cues which 
permit the discovery of the large units which are the 
components of the surrogate structure will obviously impair 
the formation of the image. 

It must be kept in mind that there may be many 
possible ways for chunking to occur. Chunking may be 
induced or facilitated by the nature of the materials to be 
learned, by various experimenter manipulations, and on S*s 
own initiative. Epstein (1967) has suggested that a certain 
level of syntactic organization is essential to the use of 
chunking with verbal sequences. This leads to our present 
contention that the phrase-structure of a sentence offers a 
ready-made set of chunks and chunk boundaries for Ss. Of 
course, Johnson (1970) , by using physical spaces between 
letter groupings, has also presented ready-made chunks to 
his Ss. 

As well, a number of other manipulations have been used 
by experimenters to induce chunking. McLean and Gregg 
(1967) also used letter groupings to induce chunking. 
Loughery and Specter (1972) varied number of repetitions of 
groups of letters and found this to be a significant factor 
in recall. 

Finally, it is self-evident from everyday observation 
that we all attempt to chunk information when memory load 
would be heavy. Seven-digit phone numbers are usually 
rehearsed and communicated in chunks of 3 and 4 digits. 
Thompson and Roenker (IS 71) have even presented an analysis 
of how Ss "learn" a clustering strategy in an experimental 
task. 

In summary, the research to date would suggest that the 
facilitating effect of grammatical organization on sentence 
learning lies mainly in its organizing and segmenting 
properties. There are still many questions unanswered 
concerning the processes by which coding of sentential 
information takes place. If we assume that "chunking" of 
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some sort does take place in storage of sentences^ several 
major questions nay be asked: 

1. How are chunks labelled or identified in memory? 

2« How are chunks retrieved from memory storage? 

3. How are chunks kept ordered in menory? 

4. How does recall of one chunk affect the 
probability of recall of others? 

5. What is the developmental sequence that gives rise 
to the ability to chunk verbal information? 

The most general^ and ambitious^ attempt to ansv;er 
these kinds of questions is provided by the decoding- 
operation model of N. F. Johnson (1970^ 1972)* This model 
assuraes that there is a single mencrial representation 
(code) for the information contained in a chunk* As a 
coi'ollary of this viev;^ the model also assumes that there 
are no direct connections among the items in a chunk of 
information. Ec»ch item is independently tied to the chunk's 
code* 

This viev; of coding makes several firm predictions 
about learning and recall* If chunks are unitary^ they 
should be recovered from memory in an all-or-none fashion. 
Once a chunk code is recovered from memory^ all the 
information represented by a code should be available* 
Since codes may represent either other codes or response 
items ^ there exists the possibility of hierarchical ordering 
of sequential information* 

Johnson's decoding-operation model hypothesizes that 
(at the time of recall) recovers some code for an entire 
sequence and then decodes it into the codes (or items) at 
the next lower level of organization in the hierarchy. It 
is assumed that the codes recovered in each decoding 
operation are tagged with order information* A S will 
temporarily store all codes except for the one wTth temporal 
priority (information gathered from the order tag) ? this 
code v/ill be decoded and so on until a response item is 
produced* At this time will retrieve another code from 
memory and continue the process* An interesting possibility 
raised by the model is that the decoding and "read-out" 
functions may be performed m»ore or less in parallel; may 
be producing item X while decoding the code that represents 
item X + 1* A final assumption of the model is that Ss are 
relatively conservative and will terminate their recaXl 
attempts when uncertainty about a code arises* 

If Ss make recall decisions at chunk boundaries (codes 
are decoHed here) ^ it is at these points that Ss discover 
their iincertainty* Experimental results (e^g^T Johnson^ 
1970) indicate that there are clear TEP spikes on the 
transitions between the last item of one chunk and the first 
item of the next chimk* 

As well^ Johnson (1970) has found that the probability 
of terminating a recall attempt (as measured by the stop 
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TCP) increases v;ith increase in the size of the following 
chunk • Johnson (1970) has also shown that probability of 
terminatincj a recall atcer.pt is a function not of how many 
items arc left to be recalled but rather a function of how 
many chunks of information remain • 

A further set of predictions derived from Johnson's 
model concerns the relationships between items and between 
an item and the chunk code. Chunks are assumed to function 
like "opaque containers." If any item is changed, the chunk 
code v/ould longer apply. Results from several studies 
(Johnson, 19 70) confirm that changing a letter in a chunk 
results in large losses of information regarding both the 
changed and unchanged letters. As well, the degree of 
mutilation of a chunk seems to have no effect: changing one 
letter is as effective in disrupting recall as changing two 
or three letters. 

The last problem to be tackled by the model is that of 
storage of order information. The hierarchical 
organization of information imposes some restraint on the 
number of ordered sequences that could be recalled. Assume 
a seven-item .sequence (A3/CD/EFG) . Let the slashes signify 
the boundaries between chunks. If S is in the process of 
decoding the A3 chunk, it is impossible for him to recall G 
since G is not available for recall at this time. This 
severely limits the number of orders which can be recalled 
but an additional mechanism is necessary. Johnson (1972) 
assumes the existence of "order tags" associated with each 
item or code. The presence of order tags for A and B 
permits S to recall tlie items in this chunk in the correct 
order, tITat is, A precedes B. Johnson hypothesizes that 
each item may be tagged for order all the way up the 
organizational hierarchy; he presents data (Johnson, 1970) 
to support this viev;. 

Johnson's model is the most explicit current attempt to 
empirically test questions and predictions arising from a 
coding view of human memory. In the four experiments to 
follow some aspects of the model v;ill be tested in various 
experimental situations. Unlike the majority of Johnson's 
studies (which used letter sequences) our experiments will 
use meaningful English sentences where the chunk divisions 
will be assumed to be defined by the phrase structure of the 
sentences. 
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ExpGriment 1 

As a locjical outc;rowth of Miller's {1956a, 1956b) 
chunking proposal, it may be asked if recall of chunks is 
"all-or-nono'* (called the "initial reproducing tendency" by 
Muller and Pilzecker, 1900), Johnson (1969) has suggested 
that chunks function like "opaque containers," His study 
supports the conception of each elenent of a chunk being 
separately tied to the chunk code itself; the alternative 
view of strong intra-chunk associations among elements (with 
one cor.prehensive chunk code) was not -^oo^^ted. However, 
Johnson used letter trigrams rather .ivence elements. 

This ignores both preexisting associi among words and 

the constraints imposed by grammatica. "-ructure* If 
grammar functions in the ways we have ju.itimated, the 
relations among words in a sentence may affect the "all-or- 
none" process. As Johnson (1968b) himself has found, 
phrases and phrase-like strings tend to be recalled more 
often in an all-or-none fashion than random strings of 
words. 

Four main variables are expected to affect all-or-none 
recall of phrases: 

1, Sentence concreteness — Following Begg and Paivio*s 
(1969) reasoning, we might expect more all-or-none recall 
for the sentence as a whole for concrete sentences. 

2. Adjective deletion — If size of a chunk is a factor 
in all-or-none recall, deletion of one or more adjectives 
should facilitate all-or-none recall. 

3, Adverb phrase position — The closer the adverb 
phrase is to the verb, the more likely its recall with the 
verb should be. 

4. Association value — Elem.ents linked with high 
association value should be more likely to be recalled 
together than those with low value. 

There are several other clear predictions from 
Johnson's model. If Ss are in fact treating phrases as 
chunk-like units in the present study, there should be 
clear TEP spikes at phrase boundaries. If, as was found in 
Johnson (1969) , the stop TEP at the transition between the 
first and second chunks is only affected by the number of 
remaining chunks, there should be a clear effect for adverb 
phrase position. Finally, if the size of the first chunk is 
critical in determining number of omission errors, then the 
size of the first chunk should affect number of omission 
errors for the sentence. 

Method 

Subjects . A total of 179 students from the Introductory 
Psychology course at Dartmouth College served as subjects; 
thirty-six participated in the experimental sessions, and 
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the remaining 143 Ss helped in preparation of the response 
sentences. 

Design . A paired-associato paradigm v/as employed. The 
stimuli wore two-digit nurr^bers^; the responses were 
sentences. The experiment was run as a factorial design 
with two independent factors and tv/o repeated-measures 
factors. The independent factors were Concreteness 
(sentences were either concrete or abstract) and 
Association Value of the adjectives in the subject phrase 
(high or low association with the subject noun) ; the 
repeated measures were Nu-^^ber of Adjectives presented in 
the subject phrase (two^ , one^ , or zero) and Phrase 
Position (the adverbial phrase could occur at the beginning 
or end of the sentence , or be absent from the sentence 
entirely) . The design was thus a2x2x3x3 factorial 
design with repeated measures on the last two factors. 
Nine £s were tested in each of the four cells formed by the 
two independent factors. 

Materials > The sentences had the basic form Article ^ 
Adjective 1^, Adjective 2^, Subject Noun , Verb y Article y 
Object L>.oun y Preposition y Article , Phrase I^oun (e.g. , The 
weary old man openea the uiuprella near rhe park ) . Nine 
concrete sentences and nine abstract sentences were 
constructed using the following procedures. A pool of 200 
abstract sentences f having the general form described above y 
except for the adjectives y was generated by the 
experimenters. Four judges evaluated every sentence on the 
criteria of adequate abstractness of each word and the whole 
sentence y and meaningf ulness of the whole sentence when the 
adverb phrase was in each of the possible positions. Any 
sentence questioned by any one of the judges on these 
criteria was discarded; 65 sentences remained in the pool. 
These 65 sentences (which lacked adjectives in the subject 
phrase) were presented to 83 volunteer subjects from 
Introductory Psychology. These subjects were instructed to 
supply two adjectives that would "very commonly" appear 
with the subject noun in each sentence. These adjectives 
were then subjected to a frequency county and the two 
adjectives most frequently used with each sentence were 
selected as the high association (HA) adjectives. 

Since some attempt would be made to compare recall of 
various parts of a sentence (adjective y nouny verby etc.)y 
it was felt that those parts should be matched on Thorndike- 
Lorge frequency as closely as possible. To accomplish thisy 
the 65 sentences (including the HA adjectives) were sorted 
by a computer program which endeavoured to find the nine 
sentences with the greatest similarity of Thorndike-Lorge 
frequency among the words in the sentences. The program 
selected nine sentences at random from the pool of 65 
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sentences f computed the average Thorndike-Lorge frequency 
(by rank in the 54 words in temporary store) for the 
adjectives, the nounSf the verbs, etc., and then computed 
the variance of these means^ This process was reiterated 
100 tines, and tlie sample of nine sentences v;ith the lowest 
variance of ranked frequencies was chosen as the stimulus 
group. These nine sentences were then examined to be sure 
that no adjective was used in more than one sentence (one 
adjective had to be replaced by the next most frequently 
given adjective) • For each sentence tv/o adjectives that 
had a frequency of one and matched the two HA adjectives on 
Thorndike-Lorge value were selected as low association (LA) 
adjectives^ 

Once the nine abstract sentences were generated, these 
were used as the models for construction of nine concrete 
sentences matched word-f or-v/ord with the abstract sentences 
on Thorndike-Lorge values* A computer program generated 
from pools of stored verbs, nouns, and adverbs a sample of 
sentences which matched each of the abstract sentences for 
Thorndike-Lorge /alue within a range of ±5. The adjectives 
were not included in this generation. Nine concrete 
sentences which v;ere judged to be meaningful and realistic 
were chosen from the generated pool, and these were 
submitted to a sample of 60 Ss from Introductory Psychology 
who supplied "common" adjectives as was done for the 
abstract sentence pool above^ These adjectives were counted 
for frequency of use in the nine sentences, and the two in 
each sentence that were the most conimonly used and had the 
closest approximation to the abstract adjectives in 
Thorndike-Lorge value were selected as HA concrete 
adjectives^ The tv;o adjectives with the lowest usage which 
matched the corresponding abstract adjectives were 
designated LA concrete adjectives. The concrete and 
abstract sentences that were finally chosen are presented in 
Table 1-1* 



Insert Table 1-1 about here 



Procedure . Testing of Ss took place at a computer teletype 
connected to the Dartmouth Time-Sharing System; Ss were 
prevented from seeing what was being printed by a small 
screen taped to the teletype. When an entire line had been 
printed, this was advanced into view so that S could see it 
all at once. After Ss were seated at the teletype and had 
read the instructions, they were given a presentation trial 
consisting of nine number-sentence pairs with a 4-sec inter- 
pair interval. After a 15-sec pause, the first test trial 
began. The teletype printed one of the two-digit numbers, 
and S was to input the associated sentence. After S had 
input an answer (or "DON'T KNOW") , the computer asked S to 
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Table 1-1. Concrete and abstract sentences used in the 

study 

(low association adjectives are in parentheses) 
Concrete 

The faithful^ Indian (bold^raountain) scout led the cattle 
through the gap. 

The concerned^ loving (protective ^helpful) parent unpacked 
the cheese after the almonds. 

The rich^ greedy (fat^aged) king polished the diamond for 
the men. 

The small^ sharp (single^ thick) thorn jabbed the minister in 
the wrist. 

The frightened^ little (helpless, lost) child halted the colt 
by the bridge. 

The tall, old (strong, young) sentry guarded the wharf near 
the rocket. 

The alert, impartial (deliberate , exacting) referee 
identified the winner after the race. 

The proud, diligent (beaming , friendly) farmer wiped the 
tractor before the market. 

The daring, radical (indignant , alert ) student protected the 
beggar from the porter. 

Abstract 

The careful, thorough (severe, second) review missed the 
error despite the effort. 

The stiff, pompous (somber, drab) formality characterized the 
occasion at the outset. 

The recent, bad (dark , tragic) experience obscured the issue 
before the exploration. 

The bitter, suppressed (pure, wild) hate ended the harmony 
during the estrangement. 

The revolutionary, political (strong , conservative) movement 
derived the power from the incident. 

The repeated, futile (wasted, tiring) attempts caused the 
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frustration for a v;hile. 

The radical, now {rhotorical , political) ideology won the 
support after the reform. 

The sound, clear (strict , concise) logic outdid the intuition 
in the session. 

The stupid, careless (needless , incredible) blunders plagued 
the theory throughout the debate. 
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verify thvit this \;as indeed the answer he wished to enter. 
After S verified his input, the correct answer was 
presei/t'jd. After a 4-sec pause, another two-digit number 
was picGonted, and ? nade another response. This testing 
procedure contxnued^until all nine sentences had been tested 
for recall. 

Ss were given six presentation and six test trials on 
the group of nine sentences. The order of presentation of 
sentences was randonized on each trial. Follov/ing these six 
trials, each corupleted a questionnaire to assess mnemonic 
strategies. 

Results and Discussion 



Whole Sentence Analyses 

A four-way analysis of variance design utilizing the 
factors of Concrobeness (concrete and abstract sentences) , 
Association Value of the adjectives (high or low), Number of 
Adjectives in tne subject phrase (two, one, or none) , and 
Phrase Position (end of sentence, beginning of sentence, or 
absent) was used for a nuirJ^er of initial analyses. The 
first two factors were based on independent groups while the 
second tv;o factors were repeated measures. Tne dependent 
measures analyzed in this design included a) number of times 
the sentence v;as entirely correct, b) proportion of words in 
the sentence recalled correctly, c) average transition 
error probabilities (backward and forward TEPs) in the 
sentence of three kinds — deletion, commission, and stop 
(Johnson, 19 70), d) proportion of errors of each of three 
kinds — synonym errors # omission errors, and cor .aission 
errors (intrusion errors) . 

Number of times sentence entirely correct . Concrete 
sentences were recalled completely correctly more often on 
the average than abstract sentences (2.9 vs 1.63, F = 12.66, 
df = 1/32, < .01). As well, a significant effect (F = 
TT. 55, df = 2/64, ^ < .01) was obtained for Number of 
Adjectives ; two-ad]ective sentences (X = 1.7) were recalled 
completely correct less frequently than either one-adjective 
sentences (X = 2.45) or zero-adjective sentences (X = 2.62) • 
There was no significant difference between one-adjective 
and zero-adjective sentences. The most parsimonious 
explanation of this result is that the two-adjective case 
simply presents more items for S to remember and hence 
poorer recall. A similar interpretation may be applied to a 
significant result (F = 24.73, df = 2/64, £ < .01) for 
Phrase Position . V/hen the adverB" phrase is absent (X = 
2.94) , recall of the entire sentence is better than when 
the phrase is at the first of the sentence (X = 2.0) or at 
' the end (X = 1.85) . 

There is also a significant interaction (F = 3.51, df = 
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2/64^ p < .05) betv;oen Concr ete ness and Phrase Position ; the 
means are presented in Table 1-2 and Figure 1-1. 



Insert Table 1-2 and Fig* 1-1 about here 



Proportion of words recalled correctly. The results for 
this measure ""paraileT closely those obtained for the 
previous measure. Concrete sentences had an average 
proportion of .67 words recalled correctly while abstract 
sentences had a mean proportion of .45 (F = 19.3, df = 1/32, 
£ < .01). A lower pjroportion of words was recalled for the 
two-adjective case (X = .50) than for either the one- 
adjective (X = .6) or zero-adjective (X = o59) instances (F 
= 7.10, df = 2/64, £ < .01). A similar result obtains for"* 
Phrase Position ; proportional recall when the phrase is 
absent J^X = .6 4) is better than when the adverbial phrase is 
first (X = .51) or last (X* = .52) in the sentence (F = 
18.85, df = 2/64, £ < .01). 

As well, there is a significant interaction (F = 6.29, 
df = 2/64, p < oOl) between Concreteness and Phrase 
Position. The mean proportions for this interaction are 
presented in Table 1-3, and a graphical presentation is in 
Fig. 1-2. 



Insert Table 1-3 and Fig. 1-2 about here 



Forward de let io n TEPs . There was a significant main effect 
for Nu friber of Adjective s (F = 10.63, df = 2/64, £ < .01). 
The zc.ro-adjective ca^e nad a significantly lower mean 
forward deletion TEP (.011) than either the one-adjective 
condition (.020) or the two-ad jective condition (.037). 

The mean TEPs for the significant interaction between 
Phrase Position and Concretenes s (F = 3.53, df = 2/6 4/ p < 
.05) are presented in Table 1-4 and Figure 1-3. 



Insert Table 1-4 and Fig. 1-3 about here 



Backward deletion T EPs . The only significant effect was 
that of"' M umber of Adjectiv es (F = 6.88, df - 2/64, £ < .01). 
The zero-ad^ective conditTon had a signiFTcantly lower mean 
TEP (.010) than either the one- adjective case (.026) or the 
two-adjective case (,032). One possible explanation for 
this result ir that Ss were learning some parts of the 
sentence (e.g., the subject phrase) in a right-to- left 
order. Thus, Ss would learn the subject noun first and the 
adjectives later. When there were no adjectives to be 
learned (the zero-adjective case) , the mean backward 
deletion TEP for the sentence was lowered. Test of this 
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Table 1-2. Interaction of Phra» e P osition x Conoreteness — 
Number of sentencss "i'nP.rely correct 

Sentence Type 

Concrete Abstract 

Beginning 2.61 1.39 



Phrase 
PositTon 



End 



2.72 



.98 



Absent 



3.37 



2.52 
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Table 1-3. Mean proportion words recalled correctly — 
Interaction of Phrase Position x Concreteness 

Sentence Type 

Concrete Abstract 

Beginning .62 .42 

Phrase 
Position 

End .68 .37 



Absent o72 .57 
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Table 1-4. Interaction of Phrase Position x Con ere teness — 

Mean forward deletion TEPs 

Sentence Type 

Concrete Abstract 



Phrase 
Position 



Beginning .013 .022 

End .017 .042 

Absent .026 .016 
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hypothesis must await review of the specific transition 
error patterns in a later section* 

Forv;ard coi umlssion TCPs* There was a significant 
effect for '":iur.\oor of Adjective s (F = 5.07, df = 2/64, £ < 
• 01)» The t\;o-adjcctive case had the highest mean TEP 
(•087) follov;ed by the zero-adjective case (.071), then the 
one-adjective case (o056)# 

Backward conunission TEPs. Only one significant result was 
obtained — an effect for Number of Adjectives (F = 4.12, df = 
2/64, p < .05). The ordering was the same as for forward 
commission TEPs : the two-adjective case had a mean TEP of 
.066, follov/ed by .0 53 for the zero-adjective case, and .041 
for the one-adjective case. 

Forward stop TEPs . Concrete sentences had a significantly 
lower mean TEP (.005) than abstract sentences (.035) (F = 
28.89, df = 1/32, g» < .01). The difference between these 
two means is especially striking if we convert these 
probabilities to mean frequencies. Stop errors occurred 
about once every three Ss for concrete sentences while each 

learning abstract sentences made about two stop errors 
over his six trials. 

Backward stoo TEPs. The only significant result was for 
Phrase Position Tf = 4.68, df^ = 2/64, £ < .05). When the 
phrase was at the beginning, there was a significantly 
higher mean TEP (.017) than when the adverb phrase was 
cibsent or at the end of the sentence (both means were .006). 
One possible explanation is that Ss found the learning of 
sentences with adverb phrase at the beginning to be 
difficult because the sentence sounded stilted; they may 
have been recalling the sentence with the adverb phrase 
moved to the more natural position at the end of the 
sentence. This would result in higher backward stop TEPs 
for the condition when the phrase was presented at the 
beginning of the sentence. However, given that the adverb 
phrase (or part of it) was recalled at all, less than one 
half of one percent of the time did Ss recall it out of 
position. 

Synonym errors ^ These are errors that were judged to 
involve replacement of a correct word by a synonymous one 
(e.g., replacing "diamond" by jewel" in recall). Abstract 
sentences had a significantly higher proportion of words 
replaced by synonyms (.028) than concrete sentences (.018) 
(F = 7.55, df = 1/32, < .01). 

Omission errors . A significant effect for Number of 
Adjectives (F= 6.56, df = 2/64, £ < .01) showed that there 
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were significantly fewer omission errors for both the zero- 
adjective (.36) and one-adjective cases (.35) than for the 
two~adjective case (.43). Adverb Phrase Position also had 
an effect (F = 12.69, df = 2/64, £ < .01). When the phrase 
was absent, there were significantly fewer omission errors 
than when the phrase was at the beginning or end of the 
sentence (.30 vs .41 and .42) o A significant effect for 
Concrote ness (F = 28.06, df = 1/32^ £ < .01) revealed a 
large difference between concrete sentences (.26) and 
abstract sentences (.55). 

As well, there was an interaction (F = 5.01, df = 2/64, 
< .01) between C oncreteness and P hrase Position. The mean 
proportions are presented in Table 1-5 and Figure 1-4. 



Insert Table 1-5 and Figure 1-4 about here 



Subject Phrase Analyses- - Word Frames 

Due to the importance attached to the variables 
manipulated in the subject phrase (association value of 
adjectives and number of adjectives), a three-factor 
analysis of variance design ( Concreteness by Association 
Value by Number of Adjectives ) was used to analyze 
individually each of the first 5 word frames on the 
following dependent measures: a) proportion of words 
correct, b) number of synonym errors, c) ntmiber of omission 
errors, d) number of commission errors. 

In the analysis of errors in the individual word 
frames, the factor Number of Adjectives was treated as a 
two-level factor in the analyses of adjective errors because 
no adjectives occurred in the zero-adjective condition. As 
v;ell, since there were no significant differences on 
proportion correct between adjective one and adjective two 
in the two- adjective condition, a mean value for the two was 
computed and used in the analyses. 

Because of the large number of significant effects 
obtained, only the theoretically important ones will be 
reported. Most of the effects pertaining to "The" will be 
omittedo 

Recall of the subject phrase noun . Two variables exhibited 
strong main effects in this analysis (and in the ones to 
follow). The concreteness of the sentence affected both the 
proportion of nouns recalled correctly (F = 10.82, df = 
1/32, p < .01) and the number of omission errors (F = 17.25, 
df = 1/32, £ < .01). Concrete nouns were recalled correctly 
67% of the time while abstract nouns were recalled 48% of 
the time. As well, concrete nouns were omitted 24% of the 
time while abstract nouns were omitted 44% of the time. 

The number of adjectives also affected both recall of 
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Table 1-5. Interaction of Concrete ness x Phrase Position — 
Mean proportion of words recalled as omission errors 

Sentence Type 

Concrete Abstract 

Beginning .14 .41 

Phrase 
Position 

End ol3 .42 

Absent .11 .30 
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the nouns (F =8.58, df = 2/64 , £ < .01) and the number of 
omission errors (F = 11.09, df = 2/64, £ < .01). The noun 
was rocaJlod loss often in the tv;o-adjective case (50%) than 
in either thn zero-adjective or one-adjective cases (59% and 
63%, respectively). VHien there were omission errors, they 
were more likely to occur in the two--adjective case (41%) 
than in the one-- adjective (3%) or zero *ad jective cases 
(28%). 

Recall of the verb o Almost the identical pattern occurs 
with verb recall as does v;ith the subject noun recall. 
Concret eness and Number of Adjectives yield the only 
significant effects , and they do so for proportion correct 
and number of omission errors. Concrete verbs were recalled 
correctly 6 3% of the time while abstract verbs only 40% of 
the time (F = 15.77, d£ = 1/32, £ < .01). Concrete v^rbs 
were omitted 25% of the time while abstract verbs suffered 
this fate 50% of the time (F = 25.98, df = 1/32, £ < .01). 

In the two-adjective case, verbs were less frequently 
recalled (44%) than in either the one-adjective (56%) or 
zero-adjective (54%) cases (F = 9.83, df = 2/64, £ < .01). 
As well, when two adjectives were present, there were more 
verb omission errors (4.4%) than when one (3.5%) or no 
adjectives (3.4%) were present (F = 10.22, df = 2/G4 p < 
.01). ^ 

Recall of a djectives . When two adjectives were present, the 
average recall frequency was 50% .vhile the one-adjective 
case yielded 59% recall (t = -2.31, df = 35, £ < .05, two- 
tailed). 



Between-Phrase — Within Phrase Transitional Error Probability 
An alyses 

To examine what effect each variable had on 
unitization within the subject phrase, the mean forward TEP 
(deletion, commission, and stop) within the subject phrase 
was compared to the TEP for Lhe subject noun-verb 
transition. Here the noun-verb transition is being treated 
as a reference datum. This analysis design is the same as 
the three-factor design used for the analysis of the word 
frames above, with the addition of Transition Type (wi thin- 
phrase and between-phrase) as a repeated-measures factor. 

Because the important question involves differential 
unitization between or within the phrases, only those 
significant results involving Transition Type will be 
reported. 

Forward Stop TEPs . There was a significant effect for 
Transi tion Type (F = 34.57, df = 1/82, £ < .01). The mean 
wi thin-phrase TEP was .02 whTFe it was .15 for the between- 
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phrase transition. The high TEP level for the between- 
phra.so transition is important support for our contention 
that Ss wore learning the phrases as relatively isolated 
units (chunks). There was also a significant interaction 
involving Concrotoness (F = 21.97^ df = 1/32^ £ < .01). 
Table 1-6 presents the relevant TEPs. 



Insert Table 1-6 about here 



As may be seen from Table 1-6^ the significant 
interaction comes from the very high value for abstract 
betwcen-phrase treuisitions . Given that the noun is 
recalled^ there is a 27% probability that S will terminate 
his recall attempt at that point. The Ss are obviously 
treating the subject-predicate boundary as a major 
psychological boundary in their recall of abstract 
sentences . 

Forward deletion TEP s. Again, there is a significant main 
effect for Transition Type (F = 6.61, df^ = 1/32, p < .05). 
Now the mean TEP is .05 for between and .11 for within the 
phrase. ^ This result suggests that when Ss make a deletion 
error, it will much more commonly involve a word within the 
subject phrase than it will involve the verb. The locus of 
this effect is seen clearly in the significant interaction 

Transition Tvp£ and Number of Adjectives (F = 3.56, df = 
2/64, p < .05) .""Table 1-7 and Fig. 1-5 show that the vaTue 
of the mean forward deletion TEPs is about constant for both 
the two-adjective case and the one-adjective case and 
plummets to zero with the zero-adjective situation. 



Insert Table 1-7 and Fig. 1-5 about here 



For the within-phrase situation, it is probable that 
what is being deleted is/are the adjective (s) . When 
adjectives are present, they also exert an influence beyond 
the subject phrase; they affect the probability of deletion 
errors of the verb also. This effect is not predicted from 
Johnson •s decoding-operation model; there is nothing in the 
model to suggest how characteristics of one chunk could 
affect recall of items in another chunk. Chunks may not be 
"opaque containers" after all. Another finding appropriate 
to mention at this time is a significant effect for Phras e 
Position on an analysis of the number of times the sentence 
was recalled completely wrong. Johnson's model predicts 
more omission errors for the sentence as a whole the longer 
the first chunk is. If the adverbial phrase is first in the 
sentence, the first chunk contains 3 items; if the phrase is 
either last or omitted, the first chunk still (on the 
average) contains 3 items (the subject phrase). Thus, 
there should be no clear differences because the lengths of 
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Table 1-6. Mean forward stop TEPs for interaction of 
Concreteness by Transition Type 

TransitioT\ Type 

Within Between 



Concrete 



Abstract 



.00 
.X)4 



.03 
.27 
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Table 1-7. Mean forward deletion TEPs for interaction of 
Transition Type and Number of Adjectives 

Number of Adjectives 

2 10 

' Within .174 .152 0 



Transition 



Between .075 .06 .015 
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the first chunk arc con r, tan t, 

Howovor^ diffcronccs do occur. \\Ticn the phrase is 
firsts there are an average of 2»3 cases whe,re the sentence 
is recalled entirely incorrectly* When the phrase is last^ 
this figure in 1,7 and 1*4 when the phrase is omitted. This 
difference cannot be explained on sequence length either 
since the sentence lengths in the three cases above (in 
order) are: 9^ 9^ 6, 

Obviously^ something other than the size of the first 
chunk is affecting the number of times S has completely lost 
the sentence from memory. Probably^ the disruption in 
familiar order of the sentence phrase elements is a great 
hindrance both to Ss storage and retrieval of the sentence • 
This is not to deny the validity of Johnson's model; the 
effect simply shows that there are other factors operating 
which can overpower the effect of the length of the first 
chunk on omission errors. Another prediction from Johnson's 
theory is that the number of chunks remaining to be recalled 
exerts a strong influence on stop TEPs at a chunk 
transition. However ^ no significant differences at the 
noun-verb transition were found as a function of number of 
remaining chunks. 

All -or- None Learn ing 

The last major analysis utilized the factors of 
Concreteness , Association V alue ^ and Number of Adjectives to 
examine what effect the mani pulations had on all-or-none 
recall of sentences. The dependent measure used for this 
analysis was a modified Z' clustering statistic (Flagg & 
Reynolds^ 1972). Briefly, the clustering statistic is 
calculated as follows. For each recall attempt by each S a 
measure (0) is computed which describes the number of paFrs 
of correctly recalled words in the recall string. For each 
possible string length, an estimate based on random 
placement of words during recall, is also computed. The 
difference between 0 and M divided by the predicted variance 
(V) gives an S-statistic which can be converted to a 
standard score (Z). To solve the problem of unequal ranges 
and variances with different string lengths, this score is 
divided by a constant to yield a Z' statistic (with a range 
of +1 to -1). The more positive this figure is, the more 
correctly ordered or clustered the recalled string is. If 
we omit from our analysis those cases where no recall 
attempt was made, the z' statistic becomes a measure of all- 
or-none learning. An independence model of learning 
predicts essentially a random placement of items in a recall 
string. The more that items are recalled in the veridical 
order, the less an independence model holds. Of course, 
neither an independence nor all-or-none model has any 
specific predictions when recall is zero/ hence this is 
omitted from our analyses. 
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None of the m^tnipulated factors resulted in 
significant main effects. This is very discouraging given 
tlio strong theoretical grounds on which these factors were 
chosen. 

The jiiaiii findinns of this study in relation to N. F. 
Johnson's dccodinq-oporation ir.odel and previous research on 
sentence learning cc\n bo sumn^arized as follows. 

The finding of clear stop TEP spikes at major phrase 
boundaries verifies the argument that Ss treat phrases as 
chunks of inforsiiation and that surface-structure granunatical 
form can function effectively as a segmentation procedure 
for the study of chunking of information. 

The size of the initial chunk of information does not 
seem to be the main determinant of omission errors. Rather , 
the more disruptive factor is a manipulation involving 
placement of a Phrase, normally found near the verb, at the 
beginning of thv. ^entence. Although grammatically 
legitimate and serantically intelligible, the resulting 
sentence is very difficult for S to learn, store, or 
retrieve. As well, it was found that the number of chunks 
remaining to be recalled did not affect stop TEPs at the 
noun-verb transition. 

The maiii questions which generated this study remain 
unanswered — those related to the variables which affect all- 
or-none learning. No significant differences were obtained 
with any of the variables employed. It is possible that the 
dependent measure employed (Z* statistic) is either unsuited 
or insensitive in this situation. Attempts will be made in 
the future to find more promising measures. 

The instances where clear significant effects were 
found are numerous nonetheless. The concreteness of the 
stimulus sentences almost never failed to be significant on 
any of our several measures of recall performance. This is 
in line with the reported potency of this variable (e.g., 
Paivio, 1971) under many experimental paradigms. It is 
clearly seen in the present study that concreteness also has 
large effects in paired-associate sentence learning. In the 
studies to follow this potency will again be demonstrated 
many times; thus f a discussion of the possible theoretical 
reasons for this will be postponed until the end of the 
report. 

Given the encouraging findings regarding phrase 
structure as a chunking/segmentation cue, it was felt that a 
developmental study was in order to attempt to assess the 
beginnings of this strategy in childhood. The next 
experiment explores this and several related questions. 
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Ex porimon t 2 



This study was dosic:ncd to explore soinc of the possible 
develop:r.ontal processes v;hich would lead to the types of 
chunkinq porforr.ancc uncovered in Kxp. 1* The literature in 
this area is scant. Only one study (Matthcv/s^ 1965) has 
explored the devo lopraental trends in chunking as treated by 
Johnson 'i; model. This study (an unpublished M. A. thesis) 
exandned the pattiirns of sentence TIjPs of children in grades 
one through four urging a paired-associate task. The main 
dependent measure was th^ correlation between obtained TEP 
pattern and the predicted pattern based on Johnson •s 
decoding-operaticn model. The mean correlation showed an 
abrupt jump between seven and eight years of age. This 
result is consistent with previous word association data 
suggesting a shift from syntagmatic to paradigmatic response 
styles at this period. The results can be interpreted in 
tv;o (not necessarily independent) ways. First, the pattern 
may represent an increase in the children* s functional 
knowledge of parts of speech. It may also indicate a change 
in the functional knowledge of grammar. 

In an attempt to explore the consistency of this 
pattern we decided to ask children to perform a sentence- 
learning task (as Matthews, 1965, did) and also to take a 
standardized word-association test. We hoped that 
administration of both measures on the same children would 
clear up some of the conceptual haziness deriving from 
Matthev.'5.* use of word association n orms only. 

Initially, we attempted a paired-associaLe procedure as 
was done by Matthews but soon uncovered a surprising 
finding. Although the objectively-scored measures of word 
recall agreed well with the findings reported by Matthews, 
it was clear that the results were in some sense spurious. 
Take, for example, the performance of children in the first 
grade. Over five trials of paired-associate learning with 
six number-sentence pairs per trial, children were showing 
little evidence of correct recall until the third or fourth 
trials. This was not due, however, to lack of recall or 
faulty recall within a sentence but rather to many instances 
of complete (correct) recall of a sentence to the wrong 
number cue. We felt that TEPs based on correct recall to 
inappropriate cues would possibly be misleading. It seemed 
evident that even the very young children were learning the 
sentences fairly quickly but were simply failing at the 
number-sentence linkage. For this reason we modified our 
learning task to that of a probe-recall operation in the 
hopes that it would give a purer measure of the integration 
and chunking strategies used by the children. 
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Me thod 

Subjects, Tv/onty students from each of grades 1-4 from the 
Wilder (Vt.) :"3oraentary School and tv/enty volunteers from 
Introductory Psychology at Dartmouth College served as S s . 

General design ^ The task was a modified probe- recall 
proccdure"v/here S was presented with a block of six 
sentences for a presentation trial and then given one of the 
phrases of one of the sentences as a cue for recall of the 
sentence. All six sentences were tested for recall on each 
trial; there were five trials with order of administration 
of both the cue phrase and the sentences varying from trial 
to trial. After S had responded on each presentation, the 
full correct sentence was read to him. There were two types 
of sentence materials (concrete and abstract) and three 
possible cues (subject phrase, modifier phrase, or verb 
phrase). The general design thus employed the factors of 
Grade (one through four plus adults) , Concre teness (concrete 
or abstract sentences) , and Probe (three possible Phrase 
cues). The first two factors are independent; ten Ss were 
tested in each cell of the 5x2 design. The last factor is 
a repeated-measures variable. 

Materials . All sentences used were of the general form " The 
wall around the grey c astl e was high . " The three possible 
cue phrases are: 1) The wall , 2) around the grey castle , 
and 3) was high . Six abstract and six concrete sentences of 
the above form v;ere generated by E from a pool of AA count 
words (Thorndike & Lorge, 1944). These sentences were 
independently evaluated by two colleagues for ease of 
understanding, meaningf ulness , and compareibility of 
difficulty. In order to make the sentences as 
representative as possible, a second form of each sentence 
was constructed where the only change was the substitution 
of another AA adjective in the modifier phrase. These two 
parallel forms of the sentences were administered to equal- 
sized groups of S s ; since there is no theoretical issue at 
stake, no analyses were performed on the alternate 
versions, and hereafter this manipulation will be ignored. 

The word-association stimuli used were the first 
twenty-five stimulus words of the Kent-Rosanof f list with 
the exception of the word " mutton " which many of the 
younger children did not understand. 

Procedure . Testing of the elementary school children was 
individual ; all sentences were presented orally by a young 
female experimenter with good rapport with the children. 
The sessions were tape-recorded and later transcribed. 

After instructions using a practice sample, E read each 
of six sentences twice; S was asked to repeat each sentence. 
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Trial 1 began v;ith E giving S a probe phrase. The probe was 
presented as a "clue" or a "part of the sontencer" and S was 
asKed to remonbor or "guess" the whole sentence. After a 
response v.-as c/ivon by l£ corrected any mistakes by reading 
the fi'll correct sentence and asking S to repeat the correct 
answer. If S had responded correctly^ E gave verbal 
approval cmd repeated the correct answer. This continued 
for five trials v/ith the order of presentation of sentences 
varying from trial to trial but consistent for all Ss. 
Children received either con^-rete or abstract sentences. 
After the sentence- recall part of the study ^ children were 
tested on a 24-iteru word-association test with standard 
instructions ("Tell me the first word that you think of ^ 
etc."). At the conclusion of all testing each child 
received a small reward (toy^ charm, etc.). 

The procedure was essentially the same for the twenty 
adult Ss with two exceptions: Ss were tested in two groups 
of 10 each, and responding was written, not oral. The same 
E that tested the children administered the task to the 
adults . 

Results and Discussion 

Preliminary analysis of the data indicated that the 
five groups (grades 1-4 and adults) differed strikingly in 
the extent to which they used the cue phrase in their recall 
attempts (F = 14.84, df = 4/90, £ < .001). While adults 
almost always responded v;ith the cue phrase (99%) even when 
this was all they could answer, children in grade 1 used 
this strategy only 78% of the time. Thus, it was decided to 
emit the cue words from all subsequent analyses. 

Proportion of v;ords recalled correctly 

There was a striking main effect""for Concreteness (F = 
162.25 , df = 1/90, p < .001). For concrete sentences recall 
was 87% while it was only 58% for abstract sentences. 

There was also a main effect for Probe (F = 10.56/ df = 
2/180, £ < .001). The subject phrase (74%) and the modifTer 
phrase (73%) functioned as better cues than the verb phrase 
(70%). This result may be partly specific to the sentences 
used in the present study since the verb phrases were more 
general in meaning and could meaningfully be used in more 
than one sentence. There was a significant etfect (F = 
14.47, df = 4/90, p < .001) for Grade . This effect Is 
overshadowed by the signifi::ant interaction with 
Concreteness (F = 10.40, df = 4/90, £ < .001). This finding 
is presented in Table 2-1 and Fig. 2-1. 



Insert Table 2-1 and Fig. 2-1 about here 



As may be seen from Fig. 2-1, there is almost no difference 
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Grade 



Mean proportion of words recalled 
of Grade by Concreteness 


correctly for 




Concrete 


Abstract 


1 


.85 


.34 


2 


.85 


.52 


3 


.92 


.54 


4 


.86 


.64 


Adults 


.90 


.84 
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on recall of the words in the concrete senlonccs. Although 
recall is not perfect^ it is high for all age groups. 
There is no superio2*ity for adults over children. As a 
matter of fact, porforraance by the adults is actually 
inferior to that of children in grade 3. For abstract 
sentences there is^ steady improvement in recall v;ith age. 

These results raise an important point. It has long 
been a central feature of many developmental theories of 
cognition that the use of ikonic imagery is a prime 
representational technique employed by children. It is only 
with maturation that this ability becomes overshadowed or 
replaced by the use of verbal memory. The present finding 
demonstrates clearly that given the appropriate materials 
and experimental technique, the use of imagery is probably 
not lost by adults and in fact is present as an effective 
mnemonic mechanism in very young children. It is a moot 
point as to how far back on the chronological scale we would 
have to proceed in our experimental design before we found a 
decline in performance with concrete materials. 

Paraphrase Sco ring 

To explore the possibility that the results obtained 
above for verbatim recall are in fact representative of the 
memory storage abilities of the children, a further analysis 
was performed. It may be that the type of mental 
representation employed by the children is essentially 
different from that of adults, but through the use of 
verbatim scoring it appears similar for concret'. materials. 
That is, the adults may greatly surpass the children's 
performance when we use a less restrictive measure of 
recall. Hence, we decided to score the sentence recalls not 
for verbatim errors but rather for failure to recall the 
essence or core of the presented sentence. Two judges, 
naive as to the purpose of the experiment, independently 
scored all recall attempts. They disagreed less than 1% of 
the time as to what constituted a failure to recall the 
essence of the sentence. These paraphrase errors were then 
analyzed in the type of analysis of variance design used 
above , 

The results almost completely match those for verbatim 
scoring of number of words recalled correctly. The most 
interesting result is the significant interaction of 
Concretenoss by Grade (F = 7,34, df = 4/90, £ < -001), Here 
the pattern is identical to that observed above. For 
concrete sentences grades 1, 2, and 4 are essentially equal 
in performance, adults and grade 3 are the best. For 
abstract sentences there is a steady increase in performance 
from grade 1 to adults. 

This result verifies that the finding for number of 
words recalled correctly was not artif actual. 



37 



ERLC 



45 



C lust ering 

Uot only arc we interested in the number of words the 
children recall but also whether they are recalled in their 
appropriate placos. The clustering measure (Z*) described 
in Experiment 1 v;as eipploycd as a dependent measure here. 
Besides a significant main effect for Grade {F = 15.09, df = 
4/90, p < .001)^ there was also a significant interactiorT" 
between this factor and Concreteness (F = 5.94, df = 4/90^ p < 
.001). — ' ' ^ 

The results are identical to those obtained above: 
for concrete sentences grades 1, 2, and 4 are surpassed by 
grade 3 and the adults. For abstract sentences there is a 
steady increase in clustering from grade 1 to the adults. 

Word -Association A nalyse s 

The 24 word-association responsev from each child were 
scored on a number of categories. These included the 
standard categories of paradigmatic, syntagmatic, klang, and 
blanks. As well, a fifth scoring category v;as devisad: 
every response a child made that exactly matched the primary 
adult response for that stimulus was scored as "adult-like." 
Analyses of variance were performed on all these response 
measures . 

Tv;o of the dependent measures are of special 
theoretical importance to us: adult-like and paradigmatic. 
It is widely accepted that the shift to paradigmatic 
responding is characteristic of maturation processes. Both 
these measures then may be assumed to be indicators of the 
degree to which the children perceive form-class relations 
as adults do on a word-association test. 

The analyses for both adult-like and paradigmatic 
responses yielded significant effects for Gr ade (F = 30c99, 
df = 3/76, £ < .001 for adult-like and F = 46.63, df = 3/76, 
p < .001 for paradigmatic). It had been hoped that the 
pattern of means would show some resemblance to the sentence 
recall patterns displayed by the children. However, there 
was no clear indication that the grade 3 children displayed 
any differential form-class knowledge. For adult-like 
responses grade 1 had a mean of 3.5, grade 2 had 9.1, grade 
3 had 8.8 while grade 4 had 11.8. The same ordering occurs 
for the paradigmatic category. It thus appears that the 
pattern of recall displayed by the children on the sentence 
probe task cannot be explained on the basis of differential 
form-class knowledge. 

More likely, the effects noted earlier can be traced to 
the growth of the memory representational systems rather 
than to retrieval strategies at test time. The data 
strongly suggest that the children differ from the adults 
not in the degree to which they unitize and recede 
sentential information but rather in the manner in which 
such information is stored. To substantiate this idea. 
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correlations between each child's average TEP pattern on the 
sentences wore correlated with tlie average adult TEP pattern 
for the same types of sentences. There v;ere no significant 
di a Terences across grados for the average degree of 
similarity to tl^e adults' pattern of recall* However, as we 
saw earlier, there were consistent and striking differences 
across ago levels for number of words recalled regardless of 
the method of scoring. These results reinforce our 
contention that the maturational sequence observed in the 
children undor study reflects not a different means of 
chunking and organizing sentential information but rather 
the growth of a second storage system for the storage of 
abstract information. 



Exper iment 3 

It would appear from previous research (Begg and 
Paivio, 19G9; Tulvin.; and Pal:kau, 1962) that the size and 
number of cliunks of .information is determined primarily by 
the nature of the stimulus materials. The specific 
qualities of the stimuli v;hich affect chunk formation are 
still vague. As well, the conditions necessary for the 
formation of hierarchies of chunks ("super-chunks") are 
suill largely unknov;n. One important factor which does seem 
to reliably affect size and numh r of chunks forme'^ is 
memory load imposed on the Ss . The assumption thac the 
maximum number of chunks that may be stored is a fixed value 
(7+2 for Miller, 1956a; 5 for Simon, 1970) leads the way to 
this experiment. 

To the extent that forrralr.ion of super-chunks is 
dependent on the memory load imposed on S, embedded clauses 
should be more likely to be subunits of verb chunks as 
memory load is increased. V7ith low memory load conditions ^ 
embedded clauses could stand as chunks on their own. It is 
also expected that concreteness of the sentences should 
facilitate formation of super-chunks and also result in 
confusion of placement of auxiliary clauses if Ss do in fact 
unitize concrete sentences more readily than they do 
abstract ones (memory for abstract sentences is assumed to 
be tied more closely to the left-to-right order of events in 
the sentence) . 

Method 

Subjects . A total of 89 students from the Introductory 
Psychology course at Dartmouth College served as Ss; 39 of 
these helped in preparation of the stimulus sentences, and 
50 Ss were tested in the experiment proper. 

Design. The experiment was a two- component memory task (cf. 
Savin & Perchonock, 1965); Ss were presented with digit 
strings followed by sentences and were instructed to recall 
both the digits and the sentences. It was emphasized in the 
instructions that it was more important to remember the 
digits but to try to recall both if possible. 

The experiment was run as a factorial design with two 
independent factors and one repeated measures factor. The 
two independent factors were C oncre teness of the sentences 
(concrete and abstract sentences were used) and Digit Length 
(the digit strings varied in length from one to five 
numbers) . The repeated measures factor was Cl ause Position 
(the subordinate clause could occur in the middle of the 
sentence or at the end of the sentence) . 

The design was thus a 2 x 5 x 2 factorial design with 
one repeated measure; five Ss were tested in each of the 
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ten colls formed by the two indcpondent factors. 

^Ivl* -"^ontencos all were of the form: The 

n£t i^iK?! eU"^ct::,>n/ ^^inc; tho £c»^-2,^ll.t^^ v;orc cK^fir, indicated 
socuriLv for^i^' rountrv. The 40 concrete and 40 abstract 
sentencVs used'^in tiie study were generated in the following 
manner. A group of 39 students from Int )ductory Psychology 
created a corpus of abstr:ict: sentences of the above form. 
This pool was then evaluated by the experimenters; any 
sentence which did not fit the sentence form, did not make 
sense, or was questioned on abstractness , was discarded. Of 
the 60 sentences which remained, 40 were chosen randomly to 
be used as stimulus sentences • The Thorndike-Lorge 
frequencies for the adjectives, nouns, and verbs in these 40 
abstract sentences were compiled, and 40 concrete sentences 
matched v;ord-for-v;ord on Thorndike-Lorge value within a 
range of ±5 were generated. All concrete sentences were 
independently approved by three judges on the criteria of 
concreteness and moaningf ulness . Two versions of each 
concrete and abstract sentence were constructed by simp.ly 
placing the subordinate clause in the middle or at the end 
of the sentence. 

Proced u re . Testing of Ss took place at a computer teletype 
connected to the Dartmouth Time-Sharing System; Ss were 
prevented from seeing what was being printed by a small 
screen taped to the teletype. When an entire line had been 
printed, this was advanced into view so that S could see it 
all at once. After S was seated at the teletype and had 
read the instructions, he was presented with 40 learning 
trials involving the digit strings and sentences. 

On each trial, a digit string was in view for 5 sec, 
followed by a sentence for 10 sec. After this study period, 
S was presented with a letter of the alphabet and was 
instructed to recite the alphabet backwards from that point 
(for 4 sec) . At this point S was asked to recall the digit 
string and then the sentence? No feedback was given to S 
concerning the correctness of his answers. This testing 
procedure continued until all 40 different digit string- 
sentence groups had been presented; S then completed a 
questionnaire to assess mnemonic strategies. 

Results and Discussion 

The initial analyses in this study utilized the three 
factors of Concreteness of the sentence (concrete or 
abstract sentences) , D igit L ength (one to five digits to be 
recalled) , and Clause Positio n ; the last factor is a 
repeated-measures factor . 

The first analyses showed a disturbing finding for the 
ten Ss in the five-digit group. They displayed a clear 
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inferiority in terms of their accuracy of recall of the 
digit^^^ (75% recalled correctly vs 87% for the average recall 
pert oriiianco of the other four groups) ; at the same time they 
displayed a vMch higher score for pioportion of words in the 
sentence recalled correctly (80o 70% for the other four 
conditions). Since the role of the digits was to occupy a 
fixed amount of memory capacity (within each group) , it 
initially appeared that this function was not being 
perform.od in the five-digit group. 

It seemed that there may have been an attentional shift 
on the part of some or all of the Ss in the five-digit group 
such that they v;ere not paying as much attention to the 
digit-recall part of the paradigm as they were to the 
sentences* Several attempts were made to correct for this 
possibility. The two Ss in each of the five digit-length 
groups who manifested poorest digit recall were deleted; 
then, the analyses were re-performed, omitting any trials in 
which recall of the digit strings was imperfect. The same 
findings emerged for the five-digit group: poorer recall on 
digits, better recall of the words of the sentence. 

A tentative decision was made at this time to omit the 
five-digit group from the design since it appeared that Ss 
in this group were somehow approaching the experimental task 
quite differently from Ss in the other four groups. At this 
point, the post-experimental questionnaires of all Ss were 
examined in an attempt to find a clue for this differential 
performance. Over 60% of the Ss in the five-digit group 
reported consistent use of segmentation and chunking 
procedures to aid in recall of the sentences (cf. Johnson, 
1969). It may be, then, that the five-digit condition, 
while depressing recall of the digits, facilitates recall of 
the sentences by driving Ss to a shift in recall strategy 
for the sentences. Regardless of the instructions, Ss may 
try to divide their attention equally between digits and the 
sentence — wher. the digit string becomes hard to handle due 
to length, Ss try to maintain sentence recall performance by 
adopting a ne/ chunking technique. There is some suggestion 
that attentional shift is not to Llame for the recall 
pattern in the five-digit group since the correlation 
between number of errors on digit recall and number of words 
recalled in the sentences is only .17. What then is the 
cause of this recall pattern? A partial explanation emerges 
if we compare the proportion of words recalled correctly (in 
the five-digit group) when the digit-string was recalled 
incorrectly to the number of words recalled when the digit 
string was correctly recalled. If an attentional shift was 
operating, then we should predict that more words would be 
recalled when the digits were misrecalled. This is not the 
case. KTien the digits were misrecalled, 76% of the words 
were recalled; when the digits were recalled perfectly, 81% 
of the words were recalled (t = 1. 79 , df = 9 , p - .05, one- 
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tailed). There appear to be two processes going on* Firsts 
on sop.o pre.soiit-ationi; tht.»re is a lapse of concentration^ and 
Ss nusrocali J:olh the digits and parts of the sentence. On 
those instaru't ; .S s ' rocai.1 of the sentences approximates the 
yaluo.j. lounci :'ar th.e othc^r four digi t-^groups (about 70%), 
On ot'aor occasions it laay be that Ss adopt a highly 
effective (but ]>055sib]y attention-straining) chunking 
stratocr^' for botli the digits and the sentences. This leads 
to perfect rocciil of the digits and good (81%) recall of the 
words of the sentence. Furakav/a, Suydam^ and Miller (1969) 
have demonstrated how a similar effect may be obtained by 
giving explicit grouping instructions to Ss, 

Since the hypothesis of an attentional shift in the 
five-digit condition cannot be completely ruled out, it was 
decided to leave the five-digit group in the design i")ut to 
analyze only those recall attempts (for all five groups) 
where the digit string was recalled correctly. All analyses 
must now be considered in the light of the fact that the 
five-digit group appears to be using different recall 
strategies than the other four groups. 

P ropo rtion of W ords Recalle d Cor rec tly 

There were two significant main effects. The words of 
concrete sentences were recalled better (84% recalled 
correctly) than the words of abstract sentences (61% 
recalled) (F = 45.14, df = 1/40, p < .001). As well, the 
length of the digit string had in effect (F = 3.78, df = 
4/40, p < .02). The proportions recalled correctly, going 
from digit length of one to five are: .74, .73, .76, .61, 
.81. 

Propor tion of Sentences Recalled Perfectly 

Concrete sentences were recalled perfectly 32% of the 
time while abstract sentences were recalled perfectly only 
8% of the time (F = 39.4, df = 1/40, < .001). 

There was also a significant interaction between Digit 
Length and Clause Positi on (F = 2.72, df = 4/40, g» < .05). 
However, this interaction is overshadowed by the significant 
three-way interaction involving these two factors and 
C oncretenes s (F = 2.68 , df = 4/40, £ < . 05) . This 
Interaction is presentedTn Table 3-1 and Fig. 3-1. 



Insert Table 3-1 and Fig. 3-1 about here 



There are several very interesting effects eviden*- from 
inspection of Fig. 3-1. Look first at the abstract sentence 
curves. First, there is no essential difference in 
performance as a function of placement of the subordinate 
clause; the two curves overlap almost completely. Second, 
the ease of recall is ordered thus: five-digit group, then 
three-digit group, then the other three groups. 
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Table 3-1* 



Interaction of D igit Length y Clause Position y 
and Concreteness for proportion of sentences" 
recalled perfectly 



Concrete 



Abstract 



Digit 
Leng th 





Clause in 
middle of 
sentence 


Clause at 
end of 
sentence 


Clause in 
middle of 
sentence 


Clause at 
end of 
sentence 


1 


.49 


.36 


.06 


..02 


2 


.32 


.36 


.02 


.04 


3 


.21 


.28 


.13 


.14 


4 


.12 


.28 


.04 


.02 


5 


.43 


.32 


.17 


.19 
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Quito a di£fCM"CMit picture omorc]os when wo oxaniinc tne 
curves for concrt^tc sontoiiccs. No loncjer is thcro a clear 
similarity of por : orinancc^ for the two positions of clause 
placoi.u-^rit . '^Ih^'-A 1 hi} clause is at the end of the sentence^ 
recall is alnc:;L u::ifonn for the fivo-digit cjroups. 
However^ wl^on the. clause ir> in the middle of the sentence^ 
the niuubor of sontor.cos recalled perfectly correctly 
steadily declines from the one-dicjit group to the four-digit 
group. The fivc-dicjit group is much better^ almost as good 
as the one-digit group. 

It would appoar that the insertion of the subordinate 
clause bct'.vven the subject noun and the main verb has a 
substantial effect for concrete sentences. One possibi3.ity 
is that this manipulation effectively destroys the 
psychological unity of the whole-sentence idea (in concrete 
sentences) and thus interferes with recall (or storage) • No 
such effect opercites with abstract sentences since it is 
assumed that • he mode of memory representation is more 
closely related to word-to-word relations. 

It must still be explained why thi3 hypothesized 
disruptive effect of subordinate clause placement in 
concrete sentences interacts with number of digits to be 
remembered. In line with the earlier discussion the five- 
digit group is probably using an entirely different form of 
processing strategy. But what explains the steady decrease 
in recall performance from the one-digit case to the four- 
digit case? 

A possible explanation emerges if we assume that the 
length of the digit-string is having the desired effect of 
loading up memory^ at least in the cases where the string 
length is less than five. If the placement of the 
subordinate clause in mid-sentence is disruptive (in 
concrete sentences) to the formation of global images, S may 
elect to store the clause in su-jh a way as to minimize this 
effect. It may be stored as a subsidiary image or tag, but 
it must be invested with the appropriate order information 
for it to be recalled in the proper place in the sentence. 
This order information is also a component of memory load in 
a fixed-capacity system. Thus, increasing the digit length 
will increase the storage requirements for the sequence even 
more . 

Boakes and Lodwick (1971) have suggested a different 
locus for this effect. The increase in digit-length may 
have its effect not on the fixed-capacity system as we 
assume, but rather the effect may be due to the increasing 
difficulty of recalling digit strings of greater length. It 
is the interfering effect of the recall procedure itself, 
they suggest, that decreases recall of the sentence. Either 
way the effect in the present study is a relatively weak, 
though interesting, one. The interaction did not occur when 
we analyzed proportion of words correct. As well/ it is 
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difficult to specify the exact nature of the decline in 
pcrforn^anco. }t is not (as we shall see later) a simple 
lo-jii of the oi-Jru' infc^riiAtion (ro: the subordinate clause) 
since the effect does nob appear when we analyze for errors 
of clause plac^s^ont. 

Ch un Id na 

There v/ere three main questions related to chunking 
asked at the booinning oi this study: 1) As memory load 
increases, do r.ubordlnate clauses become subunits of verb 
chunks? 2) Does concretoness of sentences facilitate the 
formation of super-chunks (unitization)? 3) Does 
concrcteness result in confusion of placement of subordinate 
clauses? 

The answer to 3) is an unequivocal "No*" There was no 
significant difference between concrete and abstract 
sentences for confusion in placement of the subordinate 
clause (F = .83; df = 1/40). 

To answer quescions 1) and 2) it is necessary to 
examine both the clustering data and the TEP data. The 
Z' analysis shows that concrete sentences are better 
clustered 'Z* = .79) than abstract sentences (Z^ = .60) (F = 
22.88, df = 1/40, p < .001). This gives a qualified "Yes" 
to 2) in that it appears that concrete sentences are 
recalled in a more veridical order than the abstract 
sentences. Flowever, to verify this answer for 2) and to 
find the answer for 1) , it is necessary to examine the TEP 
patterns in recall. 

In the sentence, "The national election, since the 
results were clear, indicated security for the country," two 
major grammatical boundaries are follov/ed at transitions 3 
to 8 by the words "indicated" and "since." This is also 
true when the subordinate clause is placed at the end of the 
sentence. First, the mean TEP for each of the twelve 
transitions in the sentences was calculated for each S for 
each of the cwo possible clause placement conditions* If we 
form a ratio of the mean TEPs at these two major boundaries, 
divided by the mean TEPs of the other ton transitions, the 
resulting measure describes the degree to which S is 
treating the three main grammatical sections as Independent 
units. The higher this ratio, the less the sentence is 
being treated as a unitized whole; the lov;er this ratio, the 
more S is treating a between-phrase or between-clause 
transition like a within-phrase transition. This ratio 
turns out to be 2.8:1 on the average. However, an analysis 
of variance using the factors employed above failed to yield 
any significant effects. 

It appears then that an increase in memory load (at 
least with the technique employed in this study) does not 
facilitate the formation of superchunks (as assessed by the 
TEP pdtterns at clause and phrase boundaries). It also 

47 



ERLC 



55 



cippears tivAt t*he cc>ncrctcncv-;s of the sentences does not have 
this effect eitlior althoiujli concretcness does liave a strong 
effect ori recall in correct: order fo)* the sentences (and 
nu:/.:jor Q\. woruj t\ c.-ailod) ♦ Tlio question of the role of 
concretcness in dc^teriuinincj ordered recall will be examined 
in more detail in tiie next oxpcriiuent. 
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Expo rimon t 4^ 

A Ira.t iiurorl ant quor. t^'on v.'hich has generated little 
research is "ilov; are chunl:s kept ordered in memory, and what 
dcterr.ino:; order of production in recall?" Is it some kind 
of Iciv-to-right pi'ocessincj or primacy effect (cf. Johnson- 
Laird, j9GS)? Since the voj'b is typically the poorest 
recalled element in "The sub joct verbod the object" 
r>entencc5*> , it might even bo a serial effect. Hov;ever, this 
left-to-ri.ght emission might be superimposed on the sentence 
elements after recall or retrieval from memory. Johnson's 
(1972) nodel suggests two v.-ays in v/hich order infomation is 
preserved. First, by the very nature of the decoding steps, 
only certain elements are available for read--out from memory 
at any given time. Second, he hypothesizes that each 
element is tagged for order information as regards its 
position in the chunk and sequence of chunks. 

This experiment is designed to answer two questions* 
There is the possibility that the left-to-right emission of 
sentence elements may not, in fact, be the order in which 
these elements were recovered from memory* This prediction, 
of course, is exactly opposite to that generated by 
Johnson's (1970) decoding-operation model. To explore this 
possibility, interchangeable subject and object phrases will 
be used in a recognition task. To the extent that material 
is not recalled in a lef t-to--right order, phrases in the 
subject position and object position should be often 
interchanged upon recall. This would imply that the order 
typically observed in emission is due to the organizing 
effects of grammar and not to any ordering during storage. 
V7e should expect any confusion to disappear when a valuable 
cue (the "by" element in the passive voice) is added. 
Although the present experiment is not designed as a direct 
test of predictions from Johnson's model regarding order 
information, nonetheless the results may have some important 
consequences for the model. 

The second problem considered here involves the 
question of what is actually stored before the recognition 
trial. By manipulating the ways in which parts of 
distractor phrases are constructed, further information 
concerning hov; Ss store concrete and abstract words 
(semantically or by surface appearance) will be obtained. 

There is a possibility that chunks may be "available" 
in memory but not "accessible" to recall unless some 
retrieval cue is present (VJood, 1969). Is one word of the 
chunk the best cue? Is this one word a noun (in a noun 
phrase)? If this is the case, it would be an example of 
"redintegrative m.emory" (Horov/itz & Prytulak, 3969). 
Horov/itz and Prytulak presented convincing evidence that 
nouns have some special importance in memory for sentences. 
In phrases the noun is the better cue than an adjective 
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(Lockhart, I960); as well, in free recall the noun is better 
rcca]lod th.an the adjecl-ive {Horov;it2 & Prytulak, 1969). 
Tliev f'.'cl th.\l ihc nouii i the "nost salient'* fragment. 
Note, hov.'ovcr, that most stimuli used were concrete, and 
Bocjg and Paivio have oifc^red a more appealing explanation of 
the ease of roc.ill of noi:ns in concrete sentences. 

Anderson (3 95 3) presented data on the ease of recall in 
entire sentences of the type "The subject verbed the 
ob jr-ct ♦ " He foand that subject nouns were recalled best, 
then object nouns , tiion verbs. Some additional data on this 
issue come fro'n rxjynolds (1970) • In this study Ss had to 
transform activo-af f irmative , active-negative, passive- 
affirmative/ and passive-negative sentences. The Ss seemed 
to rely heavily on what V7right (1968) called key w ords 
(verb, object, and especially the subject). The propensity 
to do this increased as short-term memory load was 
increased. Horowitz and Prytulak have tied these and other 
findings together by shovjing that order of recall of an item 
and efficiency of that item as a cue are perfectly 
correlated. 

Some additional information on the cue-value of nouns 
comes from a study by Roberts (1968). He found that 
bidirectional Subject-Object associational linkages aided 
sentence recall. When one chunk v/as correctly recalled, the 
noun in the phrase could act as a mediated cue for the other 
phrase. 

Dolinskyaid Michael (1969) have offered a different 
explanation for findings concerning the efficacy of the noun 
as cue. They argue that the noun has special importance in 
word phrases because of its role as a post-integration 
mechanism. Uhlenbeck's (1963) "principle of sustained 
memory" suggested that the elements in a phrase such as 
articles and adjectives may remain unconnected until a later 
element (e.g., the head noun) organizes them. Some 
empirical support for this argument was offered by Dolinsky 
and Michael (1969) and by Rommetveit and Turner (1967). 

Quite a few factors seem important suspects as 
efficient cues for chunk retrieval. Nouns seem clearly to 
have a special role as mediators for phrase recall. As 
well, nouns appear to play different roles for concrete and 
abstract sentences. The effective relationship between 
nouns and their adjectives may be affected by several 
factors among which is the location of the noun phrase 
(subject or object). A similar set of relationships may 
occur v;ith verbs as cues. Verbs may well function 
differently (as cues) depending on the voice of the 
sentence . 

>fethod 

Subjects. A total of 80 Ss participated in the study; 32 Ss 
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served in the experiment proper while 48 Ss helped generate 
stimulus senLencoB . 

Donian. The c^::; >^jriiaent used a recognition inemory paradigm 
wno"ro'''iive l:c:i tonc'.^s \:?r^.^ presented in a block for S to 
study; after au interpolated activity^ a phrase (either 
unchancjod or aitcred) from one of the five sentences was 
presented to S v;ho had to signify if he had seen this phrase 
in tiio sariplo of five sentences. If §. felt he had seen the 
phrajo, he was asked to recall as much of that sentence as 
possible. The experiment was run as a factorial design with 
tv/o independent factors and three repeated-measures factors. 
The tv/o independent factors were Concreten ess (concrete or 
abstract sentences) ^ and Voice (sentences were either active 
or passive). Eight Ss V7ere"*tested in each of the four cells 
formed by the two independent factors. 

There were three repeated-measures factors ; the last 
tv/o of them define the types of alteration possible for 
changed sentences. The cue Phrase Presented could be either 
the subject phrase^ verb phrase, or object phrase; the Part 
of Speech Changed could be either the modifier or head in 
the phrase; the Type of C hange could be a change to a 
synonym or to an unrelated~\vord. 

Materials . All sentences had the general form Article , 
Adjecti ve , Noun , Adverb , Verb, Ar ticle , Adject ive , Noun 
(e.g., 'rTTe^ deep anger quickly caused the noisy outburst ) . 
All sente'nces were reversible ; "tn'at^i s , the subject and 
object phrases could be interchanged without creating a 
nonsense sentence . 

A group of student volunteers generated a pool of over 
300 abstract sentences of the form described above. These 
were examined by E, and any sentences not meaningfully 
reversible (or otherwise inappropriate) were discarded. A 
few more random discards pared the pool to 24 0 sentences. 
From this pool, 4 8 sentences were randomly chosen as test 
sentences; the remaining 192 sentences were designated as 
fillers. To control for any semantic biases generated 
during the construction of the sentences, every second 
sentence had the subject phrase and object phrase reversed* 

A computer program selected nouns, verbs, adjectives, 
and adverbs from a pool of concrete words to generate 
sentence frames that matched the 4 8 abstract sentences word- 
for-word on Thorndike-Lorge values (with a tolerance of ±5 
on frequency). The 192 concrete filler sentences were 
gathered from the scime subjects who produced all the 
abstract sentences . 

After the concrete and abstract sentences had been 
constructed, each sentence was transformed into the passive 
form and checked for meaningf ulness . Half the Ss wece 
presented with the active form of each sentence, half with 
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the passive form. 

Allorod versions of 24 of the tost sentences of each 
kind {ccn<r~oto and ahstracl:) v;ere constructed in the 
followincj i.'tannc)% r.ight alternates had a change performed 
in the ^JU^^^'-Ct phra^-^o, eight in the verb phrase, and eicjht 
in the objv^ct phrase^ (^1^2. ^I-IL'IS.?. P re r. o n ted factor). Within 
each group of ciqh.t altornatO' sentences, four had the 
raodifier (adjective^ or adverb) changed, and four had the 
head (noun or verb) altered (Part of Sp eech Changed factor). 
V.ithin each of theiso four sentences, tv;o of the changes 
involved replacement by synonyms; two of the changes 
involved extraneous (neither' synonymous nor antonymous) 
v/ords. All replaccraont words were matched to the original 
v;ords as closely as possible on Thorndike-Lorge frequency. 

There are thus 12 possible changes that could be made 
in one of the test sentences. Since there were two 
exemplars of each of these 12 types constructed, and each 
exemplar had an unaltered counterpart sentence, one half of 
the unaltered test sentences were treated as if they w^re 
altered ("new") sentences for purposes of controlling any 
inadvertent biases in the construction of the altered 
sentences. Thus, one half of the original sentences were 
moved to the altered pool, and their altered counterparts 
were treated as if they were the original sentences. 

Procedu re. Testing of Ss took place at a computer teletype 
connected to the Dartmouth Time-Sharing System; after S was 
seated at the teletype and had read the instructions, a 
block of five sentences came into view. One of these five 
was the target sentence; four were fillers.^ This block was 
exposed for twenty seconds after which S was presented with 
a letter of the alphabet. S was instructed to say the 
alphabet backwards from this start-point. After five 
seconds of alphabet-pronunciation, S was presented with a 
cue phrase which may or may not have appeared in one of the 
sentences in the preceding block of five. Half the time, 
the cue phrase was identical to a phrase presented in one of 
the preceding five sentences; half the time, the phrase had 
been altered in some manner. 5 

After the cue phrase was presented, S was asked to 
decide whether he had seen this phrase in the preceding five 
sentences (Yes/No). G s was also asked to enter a judgment 
of his confidence, on a scale from 1-5, concerning the 
recognition decision. If S felt he had seen the phrase, he 
was asked to attempt to recall as much of the critical 
sentence as possible. 

Forty-eight blocks of sentences were presented to Ss. 
On half these trials, the cue phrase was identical to one of 
the phrases earlier presented; the phrase was "OLD." On the 
other 24 trials, the cue phrase was "NEW" — it represented a 
phrase that had been altered in some way. 
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Results and Discussion 



Tho first qu-.\stion to be answered is the degree to 
which th'.^ conricuiicj judcn-z^nts given by the Ss reflect their 
true knov/lodcjo of tiho 0LD/:]L:W status of the cue phrasers. 
V.'hen OLD ouc phra.sos are cxciniined, it is found that an 
increase in the proportion of correct judgments (saying OLD 
when in fact th.-i cue phrase is unchanged) increases smoothly 
from tb.e confidence category of 1 np to the 5 category • The 
relevant proper tioni^ are presented in Table 4-1. 

Hov/ever, with NllW cue phrases, no such smooth relation 
obtains. Only with a confidence judgment of 5 is there a 
marked increase above an essentially flat linear trend for 
categories 1 through 4. It appears that the confidence 
judgments in the OLD condition more accurately reflect Ss ' 
true knowledge of the 0LD/NEV7 status of the cues (cf. 
Wearing, 19 70). 



Insert Table 4-1 about here 



For this reason^ it was decided not to include the 
confidence judgments in the analyses of the accuracy scores 
but rather to use a d' statistic (derived from signal 
detection theory) for the OLD cue phrases and a simple 
measure of proportion correct for the NEW condition. 

OLD Cue Phrases 

They'd'' measure of signal detectabili ty may be 
considered as a type cf correction for guessing or response 
bias. There are a number of possible acquiescence biases or 
response sets which may be operating in a recognition memory 
experiment of this sort. Included might be 1) an overall 
predisposition to answer OLD, 2) idiosyncratic response 
biases of S s , and 3) bias towards OLD answers for particular 
sentence types. The d' statistic effectively counteracts 
these biases by treating the "OLD" answers (when the cue 
was in fact NEW) as a baseline noise level and then 
computing a measure of signal detectability for OLD cue 
phrases . 

Each Ss proportions correct (v;hen the cue phrase was 
OLD) and proportion incorrect (when the cue phrase was NEW) 
were converted to average d' scores. Each S obtained 3 d' 
scores , one for each of the three levels of the one 
repeated-measures factor in this analysis (Cue Phrase 
Present ed) . The other two factors included in the analysis 
v;ere Conc^r e tenes s and Voice . 

There were two significant effects in this analysis. 
Concrete cue phrases were more easily detectable as OLD than 
abstract cue phrases (F = 26.77, df = 1/28, g_ < .001). As 
well, the two noun phrases (subject phrase and object 
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Table 4-1. Proportion of times correct at various 
confidence levels 



1 

Old .32 
New . 72 



2 3 
.53 .58 
.62 .70 



4 5 
.62 .81 
.77 .85 
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phrase) wore i.oro often uctoctod correctly as OLD than the 
verb piiraso (j; 5.99, of = n < .005). The failure to 

obtain a siuiiTficant ofV'Tct for VoTc e is surprisincj but is 
in lino witn woarincj (1^70) . 

:nIJ\/ Cuo Phran ^^'s 

jiac proi;or lions correct for judging NEV7 phrases were 
analy::od in a 5-vay analysis of variance v;iiere the factors 
were Voice , C one . tonor-^s , Cuo Phrase Presentea , Part of 
Speeca Ca.in:^?''!^ ana rvj^ J oi ChancT'.^ ^ 

'I'wo sx(j:iri"itjant ram efiects were obtained. Concrete 
phrases wore correctly judged liLV; nore often (84%) than 
abstract phrar^cs (69:>; F = 11.39^ of = 1/28^ p < . 005). 
Use of a syncnyn to repTace a wordHTn the cue phrase was a 
more effective uis tractor than use of an unrelated word (.6 7 
vs .85 correct; £ = 34.72, df = 1/28, £ < .001). This 
erfect has been obtained previously by Bruder and Silverman 
(1972), Anisiela (1970), and Fillenbaun (1069). 

There v/ere three significant interactions. The iiieans 
for the significant interaction of Type of Change by Cue 
Phrase Presenteu are presented in Table 4-2 ana Fig. 4-1 (F 
= S.GG, u£ = 2/5G, £< .001). This result seems to indicate 
that tr.e subject: phrase is relatively impervious to faulty 
recognition. iio\;ever^ both the verb phrase and the object 
phrase are prone to be perceived as OLD when a synonym is 
used as the distractor. 



Insert Table 4-2 and Fig. 4-1 about here 



The means for the significant (F = 4.87, M 2/56, £ < 
.02) interaction of Part of Speech CKanged by Cue Phrase 
Presented are presented in Table 4-3 and Fig. 4-2. 



Insert Table 4-3 and Fig. 4-2 about here 



The significant interaction of Type of Change by Part of 
Speech Changed (F = 4.67, df = 1/2 8, £ < .05)' is presented 
in Table 4-4 and Fig. 4-3. 



Insert Table 4-4 and Fig. 4-3 about here 



Sentence Recall 

For both OLD and UZ\'1 cue phrases, if S felt the cue was 
OLD, he alr.ost invariably made sorre attempt to recall some 
of tiie associated sentence (98.8% of the time). Hov;ever, 
there is soi'.io inaication that: even though on all sentence 
recall attei.pts S perceived the cue phrases as OLD, there 
was soK.e differential recall. For truly OLD cue phrases, 8% 
of the sentence recall attempts produced only the cue phrase 
(or parts of it) vvrhile with WLVv cue phrases (mistakenly 
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Table 4-2 • Means for interaction of Cue Ph rase Presented by 
Typo of Change. Entries are raean proportTons correct (NEV/ 
phrases) • 



Type of Change 
Synonym Other 
Subject .74 .79 

Cue 

Phrase 
Presented 

Verb .56 .88 



Object 



.72 



.88 
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Tabic 4-3. Mean proportions correct (NEV7 phrases) for 

interaction of Part of Speech Changed by Cue 
Phra55e Presented ^ 

Part of Speech Changed 
Modifier Head 
Subject .79 .74 



Verb .74 .70 

Object .74 .86 



Cue 

Phrase 
Presented 
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Tabic 4-4. Mean proportions correct for interaction of Part 

of Speech Chan^jcd by Type of Chanoe (NEV? phrases) 

Part of Speech Changed 

Modifier Head 
Synonym • 65 • 70 

Other .87 .83 



Type of 
Change 
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called OLD), lG.9*i of the recall attorpts v/ero restricted to 
tiie presented cue pliraae or parts of it. 

Lo\.\'vox\ al:..::>r>c al\;:xyc> (98.5o of the tine) v/henever S 
h^aae any recall aLtei./ot: at all^ S included son-e of the cue 
phrase in iiis response. In. the analyses to follow the part 
of the recall attoi pt cor.prising the presented cue phrase 
v/ill be aeletod. As v;ell, to avoid "iny artifactual results^ 
the functor worui, tliat appeared constantly in every sentence 
("the" and "\/as/' an.d "by" in the paijsive forn) were 
oi-;.itled \:'Acn proportion of words recalled correctly was 
computed. 

Only those instances wiiere the cue phrase was 
unaltered (OLD), and S correctly perceived this, were 
includeu in the recall analysis. The design used tor the 
analysis of variance was the same three- factor design used 
for the analysis of the signal detection scores for OLD cue 
phrases. There v/as a significant main etfect for Cue Phrase 
Pre sen tea (F = 5.23, d£ = 2/56, £ < .009). V?hen the subject 
phrase v/as che cue, recall of the v;ords of the rest of the 
sentence was 39^. accurate, v;ith tre verb phrase as cue, 32% 
accuracy obtained; when the object phrase was cue, accuracy 
was only 27%. There was a significant main effoct for 
Concroter.ess (F ^ 15.35, of ="1/28, ^3 < .001). Concrete 
sentences were recalled better (42%) than abstract sentences 
(24%). 

There v;as also a significant interaction between Voice 
and Concreteness (F = 4.73, df = 1/28, £ < .04). This 
interaccion will not be discussed in light of the 
significant three-^way interaction between these two factors 
and Cue Phrase Presented (F = 3.60, df = 2/56, £ < .04). 

Insert Table 4-5 and Fig. 4-4 about here 



Examination of Fig. 4-4 reveals the striking locus of the 
interaction effect. Three groups have essentially similar 
patterns; however, abstract passive sentences display a 
deviant pattern. For this type of sentence subject phrases 
function as superior cues. This suggests that the 
superiority for subject phrase cues noted above is due 
aluost entirely to one type of sentence — abstract passive 
sentences. For concrete sentences and abstract active 
sentences each of the three phrases was reasonably equally 
efficacious as a cue. But for the cibstract passive 
sentences the subject phrase clearly excels. VJhy is this? 
If, as has been hypothesized, concrete sentences are stored 
as global ideas, then no one cue phrase should be more 
important than any other for recall. 

This does not seem to be the case for abstract 
sentences and particularly for abstract passive sentences. 
Here it appears that S focusses on the first phrase during 
presentation and treats the v/hole sentence in terns of tnis 
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Table 4-5. Interaction of Cue Phrase Presented by Concreteness 
by Voice — proportion of worus recalled correctly* 

Voice 

Active Passive 
Concrete Abstract Concrete Abstract 
Subject .43 .21 .34 .42 

Cue 

Phrase 

Presented 

Verb .45 .13 .39 .20 



Object .53 .20 .39 .26 
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ph>-aso. This typo of effect lias been noted previously 
(Tannenbaun Si 'Willinrn, 19GI]; Turner & Ron^metv.-it ^ 1968). 
Althouvj:; T;.^ rolat^vo use of active and passive sentences 
for ei pnuiu '<:incj UiTl orent types of information (e.g,. actor 
or patient.) is \;'jll-kno\:n^ tho present result is more 
coKiplicatcvu The focus on t!)e subject phrase which we might 
predict rorrarUless of voice appears only for the passive 
voice for cibstract sentences. 

?-iisplacod Phrases 

One ol tiie questions v;hich gave rise to this study 
involved the order of eirission of the subject and object 
phrases in reversible sentences. Tv/o predictions v;ere made: 
1) that since abstract sentences should be stored in a form 
nore closely resei idling the left-to-right order of surface 
structure f abstract sentence recall should manifest less 
conf usability of placerent of subject and object phrases at 
recall; and 2) there should be less conf usability v/hen the 
voice is passive and the cue phrase is the object phrase 
(due to the i!::portant elenent "by") . 

Neither of these predictins v;as supported by the 
results. Although there was a significant difference (F = 
6.00, uf^ = 1/28, p < .02) for probability of recall in the 
wrong piirase posicion betv;een concrete and abstract 
sentences, concrete sentences showed less conf usability of 
phrase placer.ent (.05) than abstract sentences (.14). 

The other main effect ( Cue Phrase Presented; F = 3.39, 
df = 2/56, p < .001) shov/ed that object phrase cues resulted 
in considerable confusion (<19) while subject and verb cues 
were less disruptive of order of recall (.04 and .06 for 
subject and verb cues, respectively). 

This latter result is particularly interesting. Why 
does the object phrase result in so much disruption of order 
of recall of noun piirases when it is used as a cue? It is 
not sii.iply because it is a noun phrase since the other noun 
phrase (subject phrase) did not have a similar effect. 
First, we know from the results above that when the object 
phrase is used as the cue, proportion of words correct is at 
its poorest. Thus, the object phrase is a poor probe for 
the entire sentence since it functions less effectively in 
eliciting the other components. This suggests that its 
relation to the other elements in memory storage is the 
weakest of tlie tliroe phrases. If this is the case, then 
when the object phrase is presented as a cue, it acts as 
information v;hich in some sense is less well integrated with 
the sentence than the other two components. This is the 
second pc.rt of the explanation. Since the object phrase is 
less attached, and it is the only information presented at 
recall test tire, there is a greater likelihood that its 
position (as subject or object) will be more often 
confused. 
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At thi.'i point v;o 5;iiould u.ikc sorae attenpt to integrate 
our results iii ilxp. 4. Tv/o rcilateci viev;s of the dynaraics of 
senteujc lourainvj a^.^^^rL that \;.ion proGO is learned^ the 
core rieani:i(;i or the soiitence iri extracted and rerKM^^bered 
(e.<j.^ Go:.ui?cki^ ID'yG) or that surrogate structures are 
lorr.ju ana r>l:v)red. Poi.^.pi and Lachnan (1967) proposed that 
these surrO'jace structures arc based in part on the word 
orviciT of tl;e text anci the associcitive relations ai ;ong \/ords 
but lict noce.'. warily containincj any direct copies or 
representations of either the actual words or the 
yrai^.L^atic^il structure of the rsentence. 

In Gor.ulicki* s (1956) statericnt of the "abstraction of 
core meaning" type of theory^ he posits that Ss onit 
uniraportant v/ords and roner±)er an action-agent-ef f ect unit. 
Goriulicki found that verbs were best rep.epbered^ then 
agonts, then objects* Our results run counter to this 
finaing. For OLD cuesr NEW cues, and the recall date^ the 
logical subject \7as alv/ays the r.iost pov;erful part of the 
sentence. It was least subject to confusion in recall and 
functioned as the best retrieval cue. In all cases ^ except 
for the analysis of OLD cue phrases, object phrases v/ere 
weakest. T:';:y wore i/ore oftei; false identified wiien a 
sligiit change \;as niade^ they vrore the poorest retrieval cue, 
and when used as a cue, resulted in the greatest nuiaber of 
subject-object displacenents at recall. 

This latter result strongly questions the completeness 
of a model that predicts the recall of ordered information 
based only on the surface structure relationships existing 
in a sequence. There appears to be another, more cognitive 
nechanisu operating. Given the pattern of results 
obtained, it seems that the sequential information contained 
in the sentences is stored in a surrogate form. Although 
the ner.ory representation may be partly hierarchical (as 
suggestea by Johnson) , it seens more likely that it is the 
meaningful dynar^ic relationships among the idea 
constituents of the sentence that more influence recall. 

Consistently throughout this study concrete sentences 
surpass abstract sentences on all measu. es of recallability . 
This finding is congruent with the earlier speculation — the 
critical process seens to be the method of memory storage 
and the rela wionshios a: ong the units in the store rather 
than the coding operations hypothesized to operate at 
retrieval. 
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General Conclusions 



Thir. ic ries of cxporinentn set out to exf^loro the 
implications of Johnuon^s docoding-opcration model of recall 
waoro tlie st:i:.ulus nnliorials \:ore meaningful English 
sentences or, several typos* Four different experimental 
paradigMs \;ero used: paired-cussociate learning, cued 
recall, recosjuition r..o:..oryr and shared-capacity recall. In 
all these studies only r>xight correspondence to the results 
predicted by a theory that seeks to explain performance 
based only on the phrase-struccure of the sentence was 
obtr.ined. It may be useful to backtrack for a moniant and 
re-analyse tiic assumptions under which these studies were 
planned. 

It is possible to conceptualize performance in an 
experinental study of memory as involving at least three 
processes. First, the process of learning itself. Learning 
is probably not the proper term to use in the present 
context since we are assuming that all the lexical items 
present in the sentences are meaningfully available to S, 
Rather, S is asked to temporarily consider some of the Ttems 
of his vocabulary more salient and make them more availcible 
for recall when asked to do so. As well, S is asked to 
retain certain order information concerning these lexical 
items . 

The second process involves the storage of these two 
types of information. Here v;e had been working under the 
implicit assumption that the type of storage was at least 
partly constrained or determined by the phrase-structure 
chunk units defined by the grammatical form of the 
sentence. 

Third, the process of recall or retrieval plays a very 
important role in performance. It is here that the decoding 
from memory storage occurs, and it is here that the 
decoding-operation model specifies a numJDer of predictions. 
However, as was pointed out in Experiment 1, a number of 
these predictions fail to be confirmed v;hen meaningful 
English sentences are used as the test materials. This is 
not to say that Ss do not make use of the phrase-structure 
of a sentence in comprehension and recall. This undoubtedly 
occurs (Johnson, 1965). The question we may ask at this 
point is: Is this all that occurs? A number of other 
processes seem to have more potency during the whole 
learning- recall process. 

The results of Experiment 4 clearly demonstrate the 
probable role of storage factors. Although the phrase- 
structure cues may have been used by S during comprehension 
of the sentences, it is clear that the sentences were stored 
in some form other than as words. 

Experiment 2 shows that there is little difference 
between age groups in how they organize outputs but clear 
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and striliin'-j difforjnces that could hct attributed to storage 
ir.ode. L:.\;jLirii..cMit 3 also obtaineci e^.r^cntially negative 
r^K ull.'. j.'or :/a:ii;>l r t: ons dv.'::Lc:ncd to affect degree of 
chun;:ing, vith oiio o;:ception. It appears that increasing 
the cry load to a certain point induces a chunking 
strategy on certain trials. 

There is a po.;nibility that a fourth set of processes 
laay be operating in our expcrir.ents. The studies were not 
dcsignoci to teijt prodictiono related to this possibility so 
any theorizing of the role of reconstructive p r-ocesses must 
remain luerely speculation. 

Reconstruction reform to the process by v/hich £ 
recreates the stiiuulus material at output in terms of his 
ov7n conception of the world. This is niost clearly seen in 
cxpcriiv^-^nts involving con;plP:x cognitive material such as 
stories / but it n;ay be expected to operate at the single- 
sentence level alr.o. Reconstruction^ of a kind^ can also 
operate at the storage stage. Unfamiliar or unusual 
matcriiils presented to Ss could be most efficiently stored 
as close approximations to already-known knowledge. 
Reconstruction can also be used as a partial explanation of 
the many effects described earlier for phrase-^structure 
divisions. The output during a memory experimtont may only 
coincidentally m.atch the input because the rules used in the 
construction of the stimuli may be the samie rules used by £ 
in his construction of the response. £ is not simply 
outputting the stored contents of memory — he is constructing 
an output (from a fov; stored pieces of information) . The 
structure and form of the response appear to have been 
stored but in reality may not have been. 

One clear set of findings in this series of studies 
need not rest on speculation. The variable of Concreteness- 
Abstractness has siiown itself to be a very powerful one in 
all the experiments. Concrete sentences were consistently 
recalled better than abstract ones regardless of the mode of 
measurement of recall. In some cases the superiority of 
concrete materials is embarrassing — in Experiment 4. A 
general theory of storage that predicts that abstract 
sentences are stored with greater correspondence to word-to- 
word order fails to predict the more accurate recall of the 
order of large elements in concrete sentences even when a 
conspicuous atter.pt was made to confuse Ss by the use of 
reversible sentences. This finding cannot be interpreted by 
any current theory of memory. Even a reconstructive 
approach would fail to predict this effect since there is no 
a priori reason why concrete reversible sentences should be 
^closer to experience" in the presented form than similar 
cibstract sentences . 

In sunuTvary^ the various results of the experiments lead 
us to a view of human recall (even in the relatively 
artificial situations used here) chat must emphasize at 
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least: four iuLeractiny processes — emphasis on phrase- 
structure anvl docodincj operation alone provides only a 
partial picture of the processGS at work. Doubtless, the 
phra^jo-structuros of the presented stimuli act as ready-made 
chunl;^^ useful for cor.prcJiension and storage. It is clear, 
hov/evor, that ceiaantic factors also influence storage. 
Finally, there is a hint that reconstructive processes may 
be used by S at recall time. 
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Footnotes 



1. A different set of nine two-digit numbers was generated 
for eacli S. 

2. The order of the two adjectives was counterbalanced 
across S s . 

3. Since there were two different adjectives associated 
with each sentence, when Ss were being tested in the one- 
ad jectivo condition, for lialf the Ss one adjective was used, 
and for the other Ss the other adjective was presented. 

4. 'ach target sentence was randomly selected from a pool 
of sentences; the filler sentences were selected from a 
second pool. Each possible filler sentence was examined to 
ensure that it had no v;ords (other than "the'') in common 
with the target sentence in th/^t block. 

5. The decision regarding v;hether a cue phrase should be 
the same or altered was performed randomly by the computer 
with the restriction that no one type be presented more than 
four times in succession. 

6. While answering, S was prevented from seeing the block 
of five sentences by a cardboard screen attached to the top 
and rear of the teletype. 
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